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SEQ 
ID 

NO: 


Predicted 
beginning 
nucleotide 

corresponding 

residue of 
sequence 


Predicted snd 
nucleotide 

corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 

Glutamic Acid, F=Phenyl alanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine , R=Arginine , 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X= Unknown , *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








IRI EGH HHDWACDFS PDGALLATAS YDTRVYIWDPHNGDILM 
EFGHLFPPPTPIFAGGANDRWVRSVSFSHDGLHVASLADDKMVR 
FWRIDEDYPVQVAPLSNGLCCAFSTDGSVLAAGTHDGSVYFWAT 
PRQVPSLQHLCRMSIRRVMPTQEVQELPIPSKLLEFLSYRI 


590S 


146 


2038 


REGAGSGRMASGA\YNPYIEIIEQPRQRGMRFRYKCEGRSAGSI 
PGEHSTDNNRTYPSIQIMNYYGKGKV\RITLVTK\NDPYKPHPH 
DLVGKDCRD \GYYEAE FGQE \RRP \ LFFQN\LGIRCVKKKEVKE 
A\ I ITR\ I KAG INPFDVP* KQLND I EDCDLDWRLWFRVFLPDG 
HGNL\TTALPPV\VSSPIYDNRAPNTAEIiRVCRVNKWCGSVRGG 
DE I FLLCDKVQKDDI EVRFVLNDWEAKGI FSQADVHRQVAIVFK 
TPPYCKAITEPVTVKMQLRRPSDQEVSESMDFRYLPDEKDTYGN 
KAKKQKTTLLFQKLCQDHVETGFRHVDQDGLELLTSGDPPTLAS 
QSAGITVNFPERPRPGLLGSIGEGRYFKKEPNLFSHDAWREMP 
TGVSSQAESYYPSPGPISSGLSHHASMAPLPSSSWSSVAHPTPR 
SGNTNPLSSFSTRTLPSNSQGIPPFLRIPVGNDLNASNACIYNN 
ADDIVGMEASSMPSADLYGISDPNM1SNCSTOMMTTSSDSMGET 
DNPRLLSMNLENPSCNSVLDPRDLRQLHQMSSSSMSAGANSNTT 
VFVSQSDAFEGSDFSCADNSMINESGPSNSTNPNSHVFVQDSQY 
' 3QLSDSFPYEFFQV 


5907 


99 


1873 


"i^SVS , 1 1 1 /RXr-HRKISWPYPQPAKQNGK 

RQKRRTIESYCQDVLRRQEEFEHKEEVLQELNMFPQLDDEATRK 
AYYKEFRKWEYSDVILEVLDARDPLGCRCFQMEEAVLRAQGNK 
JWEKWLDYLRNELPTVAFKASTQHQVKNL 
II CGVPVDQASESLTiKS KACFGAENLMRVLGNYCRLGEVRTHIR 
VGWGLPNVGKSSLINSLKRSRACSVGAVPGITKFMQE V Y L.DKF 

NLEEISNYYGVSGFQTTEHFLTAVAHRLGKKKKGGLYSQEQAAK 
AVLADWVSGKISFYIPPPATHTLPTHLSAE1VKEMTEVFDIEDT 
EQANEDTMECLATGESDELLGDTDPLEMEIKLLHSPMTKIADAI 
ENKTTVYKIGDLTGYCTNPNRHQMGWAKRNVDHRPKSNSMVDVC 
SVDRRSVLQRIMETDPLQQGQALASALKNKKKMQKRADKIASKL 
SDS.MMSALDLSGNADDGVGD 


5908 


247 


975 


--CGIKKRGEGSG5PSPASGGFQLGCQIPEPSLPSEEETHPHTRA 
HTRTLRATLTRRPPRSHSTRLRFFMPLDGDGGLASWK/PMRER* 
GWRRPAKAAGASLGVAATGKRGCRMSKRYLQKATKGKLLIIIFI 
VTLKGKWSSANHHKAHHVKTGTCE WALHRCCNKNKI EERSQT 
VKCSCFPGQVAGTTRAAPSCVDASIVEQKWWCHMQPCLEGEECK 
VLPDRKGWSCSSGNKVKTTRVTH 


5909 




5002 


PAIFGSTIIWAPGSHSAARADGRHGSLPSQSQAPGALCGARAPP 
SSNLRADRSMICAQARAGKNLYHNRFLG1AAMAFPSRNSQSLRR 
CKEPIRYSYNPDQFHNMDLRGGPHDGVTIPRSTSDTDLVTSDSR 
STLMC-RSSYYSIGHSQDLVIHWDIKEEVDAGDWIGMYLIDEVLS 
ENFLDYKNRGVNGSHRGQIIWKIDASSYFVEPETKICFKYYHGV 
SGALRATTPSVTVKNSAAPIFKSIGADETVQGQGSRRLISFSLS 
DFQAMGLKKGMFFNPDPYLKISIQPGKHSIFPALPHHGQERRSK 
I IGNTVNPIWQAEQFSFVSLPTDVLE IEVKDKFAKSRP I IKRFL 
GKLSMPVQRLLERHAIGDRWSYTLGRRLPTDHVSGQLQFRFEI 
TS3IHPDDEEISLSTEPESAQIQDSPMNNLMESGSGEPRSEAPE 
SSZSWKPEQLGEGSVPDRPGNQSIELSRPAEEAAVITEAGDQGM 
VSVGPSGAGELLAQVQKD1QPAPSAEELAEQLDLGEEASADLLE 
DG3APASTKEEPLEEEATTQSRAGREEEEKEQEEEGDVSTLEQG 
EGRLQLRASVKRKSRPCSLPVSELETVIASACGDPETPRTHYIR 
IHTLLHSMPSAQGGSAAEEEDGAEEESTLKDSSEKDGLSEVDTV 
AADPSALEEDREEPEGATPGTAHPGHSGGHFPSLANGAAQDGDT 
HPST3SESDSSPRQGGDHSCEGCDASCCSPSCYSSSCYSTSCYS 
S S CYSASCYS PSCYNGNRFASHTRFSSVDSAKI SESTVFS SQDD 
EEEESSAFESVPDSMQSPELDPESTNGAGPWQDEIAAPSGHVER 
SPEGLES PVAGPSNRREGECPILHNSQPVSQLP3LRPEHHHYPT 
IDEPLPPNWEARIDSHGRVFYVDHVNRTTTWQRPTAAATPDGMR 
RSGSIQQMEQLNRRYQNIQRTIATERSEEDSGSQSCEQAPAGGG 
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SEQ 
NO: 


Predicted 
nucleotide 
corresponding 

residue of 
sequence • 


Predicted end 
nucleotide 
location 
corresponding 
to first 

residue of 

sequence 


Amino acid segment containing signal peptide • 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Slycine, 
H=Hisridine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=31utamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








GGGGSDSEAESSQSSLDLRREGSLS PVNSQKITLLLQSPAVKFI 
TIS-pEFFTVLHANYSAYRVFTSSTCLKHMIiKVRRDARNFERYQH 
NRDLWFINMFADTRLELPRGWEIKTDQQGKSFFVDHNSRATTF 
IDPRIPLQNGRLPNHLTHRQHLQRLRSYSAGEASEVSRNRGASL 
LARPGHSLVAAIRSQHQHESLPLAYHDKIVAFLRQPNIFEMLQE 
RQPSLARNHTLREKIHYIRTEGNKGLEKLSCDADLVILLSLFEE 
EIMSYVPLQAAFHPGYSFSPRGSPCSSPQNSPGLQRASARAPSP 
YRRDFEAKLRNFYRKLEAKGFGQGPGKIKLIIRRDHLLEGTFNQ 
VMAYSRKELQRNKLYVTFVGEEGLDYSGPSREFFFLLSQELFNP 
YYGLFEYSANDTYTVQISPMSAFVENHLEWFRFSGRILGXbALI 
HQYLLDAFFT\RPFYKALL\RLPC\D\LSDLEYLDEEFHQSLQW 
MKDNNITDILDLTFTVNEEVFGQVTERELKSGGAMTQVTEKNKK 
EY1ERMVKWRVERGWQQTEALVRGFYEWDSRLVSVFDARELE 
LVIAGTAEIDLNDWRNNTEYRGGYHDGHLVIRWFWAAVERFNNE 
QRLRLLQFVTGTSSVPYEGFAAPPWEPMGLRRFLP+KKWGKITS 
LPPRG\HTCLQPDWDLPTVSPRTPMLYEK\LLTA\VEETSTFGT 


5910 


1526 


446 


r > rr j i it >r le >l - v i 

AAQLLRALGPVEIALTSILTLLALGSIAIFLEDAVYLYKNTLCP 
IKRRTLLWKSSAPTWSVLCCFGLWIPRSLVLVEMTITSFYAVC 
FYLLMIiVMVEG FGGKE AVLR TT iRDTPMMVHTGPCCCCGP CCPRL 
LLTRKKLQ\R*CWALSNTPS*R*R*PWWACFSSPTASMTQQTFL 
RGAQLYGSTLSSA/CSTLLALMTLGIISRQARLHLGEQNMGAKF 
ALFQVLL I LTALQPS I FSVLANGGQIACS PPYSSKTR3QVMNCH 
LLri,ETFI,KTVLTRMYYRRKDIIKVGYETF3SPDLDLNLKALRWM 
AWTMKGCCTH 


5911 


109 


595 


Q-jPLAPC^QGKGLEMRSPKPQSFIIRSSHSGAGIjIjVKNPSTPVF 
CGHRRGGAAFKYKPTPWGPEQRPTGQKHMRGGVSLLSPRLECS 
GTI SAHCNLRLPS S SNS PAPAS * LAG I TG VCHHAQL I F VELVET 
GFHHVGQAGLELL/NWIHLPRPPKVLGLQA 


5912 


S24 




MILNKALMLGALALTTVMSPCGGEDIVADHVASYGVNLYQSYGP 
SGQYSHEFDGDEEFYVDLERKETVWQLPLFRRFRRFDPQFALTN 
ZAVLKHNLNIVIKRSNSTAATNEVPEVTVFSKSPVTLGQPWTLI 
CLVDNIFPPWNITWLSNGHSVTEGVSETRPSSPKSDHFLLQDQ 
VTS PS FPFE* * DL * TAKVEQLGAWFEPLLKHWGAE IPTTL 


5913 


45 


1198 


QbRMAGAEGAAGRQSELEPWSLVDVLEEDEELENEACAVLGGS 
DSEKCSYSQGSVKRQALYACSTCTPEGEEPAGICLACSYECHGS 
HKLFELYTKRNFRCDCGNSKFKNLECKLLPDKAKVNSGNKYNDN 
FFGLYCICKRPYPDPEDEIPDEMIQCWCEDWFHGRHLGAIPPE 
!5Gn?".IEKVCQACMKRCSFLWAYAAQLAVTKIST\GMMDWCGTLM 
E * / DBQEVI KP3NGEHQDSTLKEDVPEQGKDDVREVKVEQNSBP 
CAGSSSESDLQTVFKNESLNAESKSGCKLQELKAKQLIKKDTAT 
YWPLNWRSKLCTCQDCMKMYGDLDVLFLTDEYDTVLAYENKGKI 
AQATDRSDPLMDTLSSMNRVQQVELIC/GIQ*FED 






124 


NLGGSELPPEEALFIQVASMNQRRVDFYLASIEDMLVAl/GGRN 
ENGALSSVETYSPKTDSWSYVAGLPRFTYGHAGTIYKDFVYISG 
GHDYQIGPYRKNLLCYDHRTDVWEERRPMTTARGWHSMCSLGDS 
IYSIGGSDDNIESMERFDVLGVEAYSPQCNQWTRVAPLLHANSE 
SGVAWEGRIYILGGYSWENTAFSKTVQVYDREADKWSRGVDLP 
KAIAGGSACFIAP*SLGQRTRKRKAKARGTRTGASDPSCASWDH 
PHRHLPGLCRPAATS 


5915 


1604 


703 


E PGRPTRPLKLGRRRKRARI IQAPHCHSPRPRTCPPGALQAPEA 
PASRAEGPVAVWNGHTEGPAPARSAPKEPPGLPRPLGSFPCPT 
PQEDFPALGGPCPPRMPPSPGFSAWLLKGTPPPPPPGLVPPIS 
KPPPGFSGLLPSPHP\PVSPAPPPPPPQK/RPRLLPAP/PGLPS 
PHKIiPGEEPSAHPVHQGLPAERRGPLQRVQEPLRGVQTGPDIiRS 
PVLQELPC-FAGGEFPEGL* *AAGPAAH 


. 5916 


256 


633 


SPRMWEIWGPWHRWESFSLEGEWPSRIPEPSPDSTKGTSGKGCR 
TVTGAVHRHLNHVAGIIPWVIiHSQLKPTAATAQDQWTSQQYPDH 
PTRLI LQ *NQATADKNN*TTALLQPHQRL\VSPRMAEA 


5917 


1343 


827 


AHQILTYLEP/' ICLWNYNK-LTVFLTKSVLE1 *KFIHTPQTYR | 
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SEQ 
ID 
NO: 


Predicted 

nucleotide 

corresponding 
to first 
amino acid 


Predicted end 
nucleotide 

corresponding 
to first 
amino acid 


{A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








?*NDFFGIKEVYVSRRLRKTSF/RLAVTFLEQAWSKECVPVDQ 
FMEHLLPSLLSLASDPVPNVRVLLAKALRQMLLBKAYFRNAGNP 
HLE VIEE T I LALQS DRDQDVS F FAALE P KRRNI I DTAVLE KQN 


5918 


13 


1247 


EGAQVARRRSRRQWRAGRCGRGRGGRRAERTGGRGPPGRPRPLP 
PGPARRGRRRMETPFYGDEALSGLGGGASGSGGTFASPGRLFPG 
APPTAAAGSMMKKDALTLSLSEQVAAALKPAPAPASYPPA\ADG 
APSAAPPDGLLASPDLGLLKLASPELERLIIQSNGLVTTTPTSS 
QFLYPKVAASEEQEFAEGFVKALEDLEKQNQLGAGRAAAAAAAA 
AGGPSGTATGSAPPGELAPAAAAPEAPVYA\NLSSY\AGGCRGL 
RGGAAT\VAPAAEPVPFPPPPPPGALGPRR?/RLALQGRRPQTV 
PDVP\SFGESP\PLSPIET\DTPRRI\KAKRKRL\RNPQIRAPK 
PASRKLGAQSRALERESEDPS * SPEHC-SLASTASLLREQVAQLK 
QKVLSHVNSGCQLLPQHQVPAY 


5919 


1 


1254 


TSVQGDSQGTPTSSQGSINMEHWISQAIHGSTTSTTSSSSTQSG 
GSGAAHRLADVMAQTHIENHSAPPDVTTYTSEHSIQVERPQGST 
GSRTAPKYGNAELMETGDGVPVSSRVSAKIQQLVNTLKRPKRPP 
LREFFVDDFEELLEVQQPDPNQPKPEGAQMLAMRGEOLGWTNW 
PPSLEAALQRWGTISPKAPCLTTMDTNGKPLYILTYGKLWTRSM 
KVAYSILHKLGTKQEPMVRPGDRVALVFPNNDPAAFMAAFYGCL 
LAEWPVPIEVPLTRKDAGSQQIGFLLGSCGVTVALTSDACHKG 
LPKSPTGEIPQFKGWPKLLWFVTESKHLSKPPRDWF\PHIKDAN 
NDTAYIEYKTCK\DGSVLGVTVTRTALLTHCQALTQACGYTEAE 

tivnvldfkkdvglwhgiltsvmmmmhvisipyslmkvnpIjSWI 

QKVCQYKAKVACVKSRDMHWALVAIIRDQRDINLSSLRMI.IVADG 
ANPWSISECDAFLNVFQSKGLRQEVICPCASSPEALTVAIRRPT 
DDSNQPPGRGVLSMHGLTYGVIRVDSEEKLSVLTVQDVGLVMPG 
AIMCSVKPDGVPQLCRTDEIGELCVCAVATGTSYYGLSGMTKKT 
FEVFAMTSSGAPISEYPFIRTGLLGFV3PGGLVFWGKMDGLMV 
VSGRRHWADDIVATALAVEPMKFVYRGRIAVFSVTVLHDERIVI 
VAEQRPDSTEEDSFQWMSRVLQAIDSIHQVGVYCLALVPANTLP 
KTPLGGIHLSETKQLFLEGSLHPCNVLMCPHTCVTNLPKPRQKQ 
PEIGPASVMVGNLVSGKRIAQASGRDLGQIEDNDQARKFLFLSE 
VLQWRAQTTPDHILYTT.LNCRGAIANSLTCVQLHKRAEKIAVML 

QNIATTLPTVKMIVEVSRSACLMTTQLICKLLRSREAAAAVDVR 
TKPLI LDTDD* PKKRPAQ I CKPCNPDTLAYLDFS VSTTGMLAGV 
KKSHAATSAFCRSIKLQCELYPSREVAICLDPYCGLGFVLWCLC 
SVYSGHQSILIPPSELETNPALWLIAVSQYKVRDTFCSYSVMEL 
GTKGLGSQTESLKARGLDLSRVRTCVWAEERPRIALTQSFSKIj 
FKDLGLHPRAVSTSFGCRVNLAICLQGTSGPDPTTVYVDMRALR 
HERVRLVERGSPHSLPLMESGKILPGVRIIIANPETKGPLGDSH 
LGEIWVHSAHNASGYFTIYGDESLQSDHFNSRLSFGDTQTIWAR 
TGYLGFLRRTELTDANGERHDALYWGALDEAMELRGMRYHPID 
IETSVIRAHKSVTECAVFTWTNLLWWELDGSEQEALDLVPLV 
TICWLEEHYLIVGVWWDIGVIPINSRGEKQRMHLRDGFLADQ 
LDP I YVAYNM 


5920 


1381 


1499 


L A.IA3VSRI PP* LFPPLHPTFLSLWCLHHKLP /HPPGASM 
VRPPWPRRPPAHISSVRQASTQVPRTVPHTQRVANIGTQTTGP 
SGVGCCTPGRPLLPCKCSSAAHSTYRVQEPAVHIPGQEPLTASM 
LAAAPLHEQKQMI GERLYPL IHDVHTQLAGKITGMLLE IDNS EL 
LLMLESPESLHAKIDEAVAVLQAHQAMEQPKAYMH 


5921 


727 


157 


I •'"^GE + GLWGQLGGLPKETPLKPMDAFTGSGLKRKFDDVDV 
GS S VSNSDDE IS S SDSADS CDSLNPPTTASFTPTS ILKRQKQLR 
RKI P -1 FDQVTVYYFARRQGFTSVPSCGGSSLGMAQRHNSVRS YT 
LCEFAQEQEVNHREILREHLKEEKLHAKKI1KLTKNGTVESVEAD 
GLTLDDVSDEDIDVENVEVDDYFFLQPLPTKRRRALLRASGVHR 
IDAEEKQELRAIRLSREECGCDCRLYCDPEACACSQAGIKCQVD 
RMSFPCGCSRDGCGNMAGRIEFNPIRVRTHYLHTIMKLELESKR 
Q\GAAQQPQ\*GALPDCQLQPDRSTGL*DPSWIGSKGLSFTGKG 
AAATHL I I LRVIENRGAEGKRK 


5922 


2475 


495 


S YSNWGLFPS VFIQVPRS RTGNLKP IFL F YS Y YE \ CMETLKG \ T 
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SEQ 
ID 
NO: 


Predicted 

nucleotide 

corresponding 
to first 

residue of 

sequence 


Predicted end 

nucleotide 

location 

cor re spending 

to first 

amino acid 

amino acid 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine , R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion! 








CLYNATQYXVCSPRNDRPDACYNPSEPAATTVFEIRTGLLLGDT" 
SK=ITRTSEKEIPKQ1TLRFEACAAINSKKLEIGCGSLN*ERS* 
RVEMKYVCHESGVCKNCAYMPCVI*AT*KKNKNDSVYLQKGEAN 
PSCAAGHCNPLELIITNPLDPHWKKGERVTLGINRTGLKPQWI 
LIKGEVHKCSPKPVFQTFYEELNLPAPELLKKTKNLFLQLAENV 
I FLLNGTS C YVRG3TTIGDRWPWEA* ELVPTDPAPD I IP I * KAE 
ASNF*VLKTSIIRQYCIAREGKDFIIPVGKPNCIGQKLYNSTTK 
TIT**DLNHTEKNPFSKFSKLKTA*AHAESH*DWTVPSGLY*IC 
RHRAYFRLPNKWADSCVIGTIKPS FFLLP IKMGELLGFSVYASR 
EKKGIVIGNWKDNEWPRERIIQYYGPATWAQDGSWGYR/TP/VY 
MLNWI IRLQAILEI ISNETGRALTVLAWQETQMRNAIYQIIRLAL 
DYLLVAEGGVCRKFNLTNCCLQINDQGQWKNIVRDMTKLAHVP 
IQVWHKFDPESLFGKWFPAIG3FKTLIVGVLLVIRTCLLLPCVL 
PLLFQMIKGIVATLVHQKTSAHWYMNHYRSISQRDSKSEDESE 
NSH 


5923 


137 


638 


QLCGRRGQRFRTS I KRMHPI * RTCPNTNL/ I ILLSQENTQIRDL 
QQBNRELWISLEEHQDALELIMSKYRKQMLQLMVAKKAVDAEPV 
LKAHQSHSAEIESQIDRICEMGEVMRKAVQVDDDQFCKIQEKLA 
QLELEHKELRELLSISSESLQARKENSMDTASQAIK 


5924 


274 


2145 


'> 1 1 1 3LSLS CKGSWETQFSNHLNSLTPPTS VRRM 
PLITTVTLLKMVARHHMKLLCSKAFSTQLQQKI FLHSQMG I HHQ 
SVCMKLKPNTSHI I S ILMGQPMALVQLETLAPLT 1 1 1 QKFQTQD 
HMKFWKNLPLHSHHLTPS VPQTVIPKKTGSPE I KLKI TKT I QNG 
RELFESSLCGDLLNEVQASE\Q*NQSIESRKEKRKKSNKKDSSR 
SEERKSHKIPKLEPEEQNRPNERVDTVSEKPREEPVLKEGSPSS 
AIMTIFCSNNGSVHW\FKFQVGDLVWEKVGTYFWWPCMVSSDPQL 
EVHTKINTRGAREYHVQFFSNQPERAWVHEKRVREYKGHKQYEE 
LLAEATKQASNHSEKQKIRKPRPQRERAQWDIGIAHAEKALKMT 
REERIEQYTFIYIDKQPEEALSQAKKSVASKTEVKKTRRPRSVL 
NTQPEQTNAGEVASSLSSTEIRRHSQRRHTSAEEEEPPPVKIAW 
KTAAAF.KSLPASI7MHKGSLDLQKCNMSPWKIEQVFALQNATG 
DGKFI DQFVYSTKG I GNKTE IS VRGQDRL I ISTPNQRNEKPTQS 
VSSPEATSGSTGSVEKKQQRRSIRTRSESEKSTEWPKKKIKKE 
QVGFLHVES 






1911 


MMTAESREATGLSPQAAQEKDGIVIVKVEEEDEEDHMWGQDSTL 
QDTPPPDPEIFRQRFRRFCYQNTFGPREALSRLKELCHQWLRPE 
INTKEQILELLVLEQFIjSILPKELQVWLQEYRPDSGEEAVTLLE 
DLSLDLSGQQVPGQVHGPEMLARGMVPLUPVQESSSFDLHHEAT 
QSHFKKSSRKPRLLQSRALPAAHIPAPPHEGSPRDQAMASALFT 
ADSQAKVKIEDMAVSLILEEWGCQNLARRWLSRDNRQENYGSAF 
PQ.GGENRNENEESTSKAETSEDSASRGETTGRSQKEFGEKRDQE 
GKTGERQQKNPEEKTRKEKRDSGPAIGKDKKTITGERGPREKGK 
GLGRSFSLSSNFTTPEEVPTGTKSHRCDECGKCFTRSSSLIRHK 
I IHTGEKPYECSECGKAF\SLNS \NLVLHQRI \HTGEKPHECNE 
CGKAFSHSSNLILHQRIHSGEKPYECNECGKAFSQSSD\LTKHQ 
RIHT ^EKPYECSECGKAFNRNSYLILHRRTOTREKPYKCTKCGK 
\AFTRSSTLTLHHRIHARERASEYSPASLDAFGAFLKSCV 


5926 


2 


233 


' L 1LKQGSQPGSPPAT/CEPEAPPVYQAPCQSCPEPPGAHEP 
5DSPHHTPVHPPPEHSAACPAPATCCPPPRSSMS 


5927 


4146 


1248 


KUFSKFGSQALYQLKRPASGQNSISVMPAQKITKPAAKYGIPLA 
vv;;ygdKKLHEKKPLQKHKQAHQTPEKRWTGEERRKISEEAAR 
KRRLEFIEKEKXQKDQIISLMKAEQMKRQEKERLERINRAREQG 
' 1 - jSAGGSGEVKAPFLGSGGTIAPSSFSSRGQYEHYHAIFDQ 
KQQQRAEDN3AKHKREIYGRGLPERQKGQLAVERAKQVEEFLQR 
KREAMQNK7ARAEGHMGILQNLAAMYGGRPSSSRGGKPRNKEEEV 
YLARLRQIRLQNFNERQQIKAKLRGEKKEANHSEGQEGSEEADM 
RRKK\IESLKAHANARAAVLKEQLERKRKEAYEREKKVWEEHLV 
AKGVKSSDVSPPLGQHETGGSPSKQQMRSVISVTSALKEVGVDS 
SLTDTRETSEEMQKTNNAISSKREILRRLNENLKAQEDEKGKQN 
LSDTFEINVHEDAKEHEKEKSVSSDRKKWEAGGQLVIPLDELTL 
DTSFSTTERHTVGEVIKLGPNGSPRRAWGKSPTDSVLKILGEAE 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

cor re spondi ng 

residue of 
sequence 


Predicted end 
nucleotide 

corresponding 
residue of 


Ammo acid segment containing signal peptide 
!A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine," 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine , R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertionl 








LQLQTELLENTTIRSEISPEGEKYKPLITGEKKVQCISHEINPS 
AIVDSPVETKSPEFSEASPQMSLKLEGNLEEPDDLETEILQEPS 
GTHKDE\SLPCriTDVWISEEKETKETQSADRITIQENEVSEDG 
VSSTV3QLSDIHIEPGTNDSQHSKCDVDKSVQPEPFFHKVVHSE 
HLNLVPQVQSVQCSPEESFAFRSHSHLPPKNKNKNSLLIGLSTG 
LFDANMPKMLRTCSLPDLSKLFRTLMDVPTVGDVRQDNLEIDEI 
EDENIKEGPSDSEDIVFESTDTDLQELQASMEQLLREQPGEEYS 
EEEESVLKNSDVEPTANGTDVADEDDNTPSSESALNEEWHSDNSD 
GEIAS3CECDSVFNHLEE1RLHLEQEMGFEKFFEVYEKIKAIHE 
DEDENI EI CSKI VQNI LGNEHQHLYAKILHLVMADGAYQEDNDE 


5928 


414S 




KHFSKFGSQALYQLKRPASGQNSISVMPAQKITKPAAKYGIPLA 
YKKYGDKKLHEKKPLQKHKQAHQTPEXRVNTGEERRKISEEAAR 
KRRLEFIEKEKKQKDQIISLMKAEQMKRQEKERLERINRAREaG 
WRNVLSAGGSGEVKAPFLGSGGTIAPSSFSSRGQYEHYHAIFDQ 
MQQQRAEDNEAKWKREIYGRGLPERQKGQLAVERAKQVEEFLQR 
KREAMQNKARAEGHMGILQNLAAMYGGRPSSSRGGKPRNKEEEV 
YLARLRQIRLQNFNERQQIKAKLRGEKKEANHSEGQEGSEEADM 
RRKK\IESLKAHANARAAVLKEQLERKRKEAYEREKXVWEEHI,V 
AKGVKSSDVSPPLGQHETGGSPSKQQMRSVISVTSALKEVGVDS 
SLTDTRETSEEMQKTNNAISSKREILRRLNENLKAQEDEKGKQN 
LSDTFEIWVHh'DAKEHEKEKSVSEDRKKWEAGGQjVIPLDELTL 
DTSFSTTERHTVG3VIKLGPNGSPRRAWGKSPTDSVLKILGEAE 
LQLQTELiLENTTIRSEISPEGEKYKPLITGEKKVQCISHEINPS 
AIVDSPVETKSPEFSEASPQMSLKLEGNLEEPDDLETEILQEPS 
GTNKDE\SLPCTITDVWISEEKETKETQSADRITIQETOEVSEDG 
VSSTVDQLSDIHIEPGTNDSQHSKCDVDKSVQPEPFFHKWHSE 
HLNLVPQV3SVQCSPEESFAFRSHSHLPPKNKNKNSLLIGLSTG 
LFDANNP KMLRTCSLPDLS KLFRTLMDVPTVGDVRQDWLE I DE I 
EDENIKEGPSDSEDIVFEETDTDLQELQASMEQLLREQPGEEYS 
EEEESVLKNSDVEPTANGTDVADEDDNPSSSSALIIEEWHSDNSD 
GEIASECECDSVFNHLEELRLHLEQEMGFEKFFEVYEKIKAIHE 
DEDEWI E I CSKI VQNI LGNEHQHLYAKI LHLVMADGAYQEDNDE 


5929 


3 


1558 


LDFSMTTQLPAYVAILLFYVSRASCQDTFTAAVYEHAAILPNAT 
LTPVSREEALALMNRNLDILEGAITSAADQGAHIIVTPEDAIYG 
WNFNRDSLYPYLEDIPDPEVNWIPCNNRNRFGQTPVQ3RLSCL\ 
AKNNSIYWANIGDKKPCDTSDPQCPPDGRYQYNTDWF\DSQG 
KLVARYHKQNLFMGENQFNVPKEPEIVTFNTTFGSFGIFTCFDI 
LFHDPAVTLVKDFHVDTIVFPTAWMNVLPHLSAVEFHSAWAMGM 
RVNFLASNIHYPSKICMTGSGIYAPNSSRAFHYDMKTEEGKLLLS 
QLDSHPSHSAVVNWTSYASSIEALSSGNKEFKSTVFFDEFTFVK 
LTGVAC-NYTVCQKDLCCHLSYKMSENIPNEVYALGAFDGLHTVE 
GRYYLCICTLLKCKTTNLNTCGDSAETASTRFEMFSLSGTFGTQ 
YVFPEVLLSENQLAPGEFQVSTDGRLFSLKPTSGPVLTVTLFGR 
LYE KDWASNAS SGLTAQAR I I ML I VI AP I VCS LSK 


5930 


113 


6082 


P. J N - _ J W I V P FTMAQRTGLEDP ER YLFVDRAVI YNPATQADWTAK 
KLVWIPSERHGFEAASIKEERGDEVMVELAENGKKAMVWKDDIQ 
KMNPPKFSKVEDMAELTCLWEASVLHNLKDRYYSGLIYTYSGLF 
CWINPYKNLPIYSENIIEMYRGKKRHEMPPHIYAISESAYRCM 
LQDREDQSILCTGESGAGKTENTKKVIQYLAHVASSHKGRKDHN 
IPGE\LERQLLQAKPILESFGNARTVQNDNSSRFGKFIRINFDV 
TG Y I VGANI ETYLLEKSRAVRQAKDERTFHI FYQLLSG\AGEHL 
K£D1LLEGFNNYRFLSKGYIPIPGQ\QDKGNFRGDPGEAMHIMG 
FSHEEILSMLKWSSVLQFGNISFKKERNTDQASMPENTVAQKL 
CHLLGMNVMEFTRAILTPRIKVGRDYVQKAQTKEQADFAVEALA 
f fl n ERLFR WLVHR INKALDRTKRQGAS F I G I LDI AGFE I FELN 
E FEQLCINYTNE KLQQLFNHTMF I LEQEE YQREGI EWNF I D FGL 
ELQPCIDLIERPANPPGVLALLDEECWFPKATDKTFVEKLVQEQ 
GSIISKFQKPRQLKDKADFCIIHYAGKVDYKADEWLMKNMDPLND 
NVATLLHQSSDRFVAELWKDVDRIVGLDQVTGMTETAFGSAYKT 
KKGMFRTVGQLYKESLTKLMATLRNTNPNFVRCIIPNHEKRAGK 
LDPHLVLDQLRCNGVLEGIRICRQGFPNRIVFQEFRQRYEILTP 



403 
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corresponding 



cid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 



S = Se: 



/=Valii 



W=Tryptophan, Y=Tyros. 

Codon, /=possible nuc! 

\=possible nucleotide insertion) 

naipkgfmdgkqacermiraleldpnlyrigqskiffragvlah 
leeehdlkitdiiiffqavcrgylarkafakkqqqlsalkvlqr 
ncaaylklrhwqwwrvftkvkpllqvtrqeeelqakdeellkvk 
ekqtkvegeleemerkhqqlleeknilaeqlqaetelfaeaeem 
rariaakkqeleeilhdlesrveeeeernq1lqnekkkmqahiq 
dleeqldeeegarqklqlekvtaeakikkmeeeillledqnskf 
:b:ekklmedriaecssqlaeeeekakwlakirnkqevmisdlee 
rlkkeektrqelekakrkldgettdlqdqiaelqaqidelklql 
akkeeelqgaiargddetlhknnalxwrelqaqiaelqedfes 
ekasrnkaekqkrdlseelealkteledtldttaaqqelrtkre 
qevaelkkaleeetknheaqiqdmrqrhataleelseqleqakr 
fkanleknkqgletdnkelacevkvlqqvkaesehkrkkldaqv 
qe lhakvsegdrlr veiae kas klqne ldnvs tllee ae kkg i k 
fakdaaslesqlqdtqellqeetrqklnlssrirqleeeknslq 
ecqeeeeearknlekqvlalqsqladtkkkvdddlgtiesleea ' 
kkkllkdaealsqrleekalaydklektknrlqqelddltvdld 
hqrqvasnlekkq\kkfdqllaeeksisaryaeerdraeaeare 
ketkalslaraleealeakeeferqijkqlradmedlmsskddvg 
knvkelekskraleqqv\eemrtqleeledelqj - l edaklr] ,e\ 
xmqamkaqferdlqtrdeqneekkrllikqvreleaelederkq 
ralavaskkkmeidlkdleaqieaaiikardev1kqlrklqaqmk 
dyqreleearasrdeifaoskesekklksleaeilqlqeelass 
2rarrkaeqerdeladeitnsasgksalldekrrleariaqlee 
3leeeqsnmellndrfrkttlqvdtlnaelaaersaaqksdhar 
qqlerqnkelkaelqelegavkskfkatisaleakigqleeqle 
qeakeraaakklvrrtekklkeifmqvederriadqykeqmeka 
marmkqlkrqleeaeeeatranasrrklqrelddateaneglsr 
evstlknrlrrggpisfsssrsgrrqlhlegaslelsdddtesk 
tsdvnetqppqse 



RG.MCFWIVPFTMAQRTGLEDPERYLFVDRAVIYNPATQADWTAK 
KLVWIPSERHGFEAASIKEERGDEVMVELAENGKKAMVNKDDIQ 
KMNPPKFSKVEDMAELTCLNEASVLKNLKDRYYSGLIYTYSGLF 
CWINPYKNLPIYSENIIEMYRGKKRHEMPPHIYAISESAYRCM 
LQDREDQS I LCTGESGAGKTENTKKVIQ YLAHVAS SHKGRKDHN 
JPGE\LERQLLQANPILESFGNARTVQNDNSSRFGKFIRINFDV 
TGYIVGANIETYLLEKSRAVRQAKDERTFHIFYQLLSG\AGEHL 
KSDLLLEGFNNYRFLSNGY I P I PGQ\QDKGNFRGDPGEAMHIMG 
FSHEEILSMLKWSSVLQFGNISFKKERNTDQASMPENTVAQKL 
CHLLGMNVMEFTRAILTPRIKVGRDYVQKAQTKEQADFAVEALA 
KATYERLFRWLVHR INKALDRTKRQGAS FIGILD IAGFE I FELN 
SFEQLCINYTNEKLQQLFNHTMFILEQEEYQREGIEWNFIDFGL 
DLQPCIDLIERPANPPGVLALLDEECWFPKATDKTFVEKLVQEQ 
GSHSKFQKPRQLKDKADFCIIHYAGKVDYKADEWLMKNMDPLND 
NVATLLHQSSDRFVAELWKDVDRIVGLDQVTGMTETAFGSAYKT 
KKGMFRTVGQLYKESLTKLMATLRNTNPNFVRCIIPNHEKRAGK 
LDPHLVLDQLRCNGVLEGIRICRQGFPNRIVFQEFRQRYEILTP 
K"AIPKGFMDGKQACERMIRALELDPWLYRIGQSKIFFRAGVLAH 
LEEERDLKI TDIII FFQAVCRGYLARKAFAKKQQQLSALKVLQR 
NCAAYLKLRHWQWWRVFTKVKPLLQVTRQEEELQAKDEELLKVK 
EKQTKVEGELEEMERKHQQLLEEKNItiAEQLQAETELFAEAEEM 
RARLAAKKQELEEILHDLESRVEEEEERNQILQNEKKKMQAHIQ 
DLEEQLDEEEGARQKLQLEKVTAEAKIKKMEEEILLLEDQNSKF 
IKEKKLMEDRIAECSSQLAEEEEKAKNLAKIRNKQEVMISDLEE 
RLKKEEKTRQELEKAKRKLDGETTDLQDQIAELQAQIDELKLQL 
AKKEEELQGALARGDDETLHKNNALKWRELQAQIAELQEDFES 
EKASRNKAEKQKRDLSEELEALKTELEDTLDTTAAQQELRTKRE 



FKANLEKNKQGLETDNKELACEVKA'-LQQVKAESEHKRKKLDAQV 
QELHAKVSEGDRLRVELAEKASKLQNELDNVSTLLEEAEKKGIK 
FAKDAASLESQLQDTQELLQEETRQKLNLSSRIRQLEEEKNSLQ 
EQQEEEEEARKNLEKQVLALQSQLADTKKKVDDDLGTIESLEEA 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 

amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 

residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidina, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutaraine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








KKKLLKDAEALSCRLEEKALAYDKLEKTKNRLQQELDDLTVDLD 
HQRQVASNLE XKQ \ KKFDOLLAEEKS ISARYAEERDRAEAEARE 
KETKALSLARALEEALEAKEEFERQHKQLRADMEDLMSSKDDVG 
KNVHELEKSKRALEQQV\EEMRTQLEELEDELQATEDAKEjRLEV 
NMQAMKAQFERDLQTRDEQNEEKKRLLIKQVRELEAELEDERKQ 
RALAVAS KKKI1E IDLKDLEAQ I EAANKARDE VrKQLRKLQBQMK 
DYQRELEEARASRDEIFAQSKESEKKLKSLEAEILQLQEELASS 
ERARRHAEQERDELADEITNSASGKSALLDEKRRLEARIAQLEE 
ELEEEQSNMELLNDRFRKTTLQVDTLNAELAAERSAAQKSDNAR 
QQLERQNKELKAKLQELEGAVKSKFKATISALEAKIGQLEEQLE 
QEAKERAAANKLVRRTEKKLKE I FMQVEDERRHADQYKEQMEKA 
NARMKQLKRQLEEAEEEATRANASRRKLQRELDDATEANEGLSR 
EVSTLKNRLRRGGPISFSSSRSGRRQLHLEGASLELSDDDTESK 

tsdvnetqppqse 


5932 


33 


572 


RHLEEICFLFLQKGRKLKLSGPRWEE3KPRGTGGLWVKAEANMG 
FGATLAVGLTIFVLSWTIIICFTCSCCCLYKTCRRPRPV\APP 
PHPP/PWHAPYPQPPSVPPSYPGPSYQGYHTMPPQPGMPAAPY 
PMQYPPPYPAQPMGPPAYHETLAGGAAAPYPASQPPYNPAYMDA 
PKAAL 


5933 


1 


3190 


GTRKLKvjiDI i ;- ( A.s.SKTRfSRDVHSSGSSDAHMDaSGPSD 
SDMPSRTRPKSPRKHNYRNESARESLCDEPHQKLSRPLLENKLK 
AFS IGKMSTAKRTLS KKEQEELKKKEDEKAAAEI YEE FLAAFEG 
SDGNKVKTFVRGGVVNAAKEEHETDEKRGKIYKPSSRFADQKNP 
PKQSSNERPPSLLVIETKKPPLKKGEKEKKKSMLELFKEELKOI 
QEERDERHKTKGRDSRFEPPQSDSDGQRRSMDAPSRRNRSSGVL 
DDYAPGSHDVGDPSTT\NFYLGNI\NPQMNLKKCCCC.EFGRFGP 
LASVKIMWPRTDEERARERNCGFVAFMNRRDAERALKNLNGPGMI 
MSFEMKLGWGKAVPIPPHPIYIPPSMMEHTLPPPPSGLPFNAQP 
REKLluMPNAPMLPPPKNKEDFEKTLSQAIVKWIPTERNLLALI 
HRMIEFWREGPMFEAMIMNREINNPMFRFLFENQTPAHVYYRW 
KLYS ILQGDS PTKWRTEDFRMFKNGS FWRPPPLNPYLHGMSEEQ 
ETEAFVEEPSKKGALKEEQRDKLEEILRGLTPRKNDIGDAMVFC 
LNNAEAAEEIVDCITESLSILKTPLPKKIARLYLVSDVLYNSSA 
KVANASYYRKFFETKLCQIFSDLNATYRTIQGHLQSEWFKQRVM 
rCFRAWEDWAIYPEPFLIKLQNIFLGLVNIIEEKETEDVTDDLD 
GAPIEEELDGAPLEDVDGIPIDATPIDDLDGVPIKSLDDDLDGV 
PLDATEDS KKNE P I FKVAPS KWEAVDESELEAQAVTTSKWELFD 
QHEESEEEENQNQEEESEDEEDTQSSKSEEHKLYSNPIKEEMTE 
SKFSKYSEMSEEKRAKLREIELKVMKFQDELESGKRPKKPGQSF 
QEQVEHYRDKLLQREKEKELERERERDKKDKEKLESRSKDKKEK 
DECTPTRKERKRRHSTSPSPSRSSSGRRVKSPSPKSERSERSER 
SHKESSRSRSSHKDSPRDVSKKAKRSPSGSRTPKRSRRSRSRSP 
KKSGKKSRSQSRSPHRSHKKSKGKTNTGRKFFKKAVTYWKCDLF 
LCPERSVF 


5934 


1 


3190 


GTRKL KMAnKTPGGSQKAS S KTRSSDVHS SGSSDAHMDASGPSD 
SDMPSRTRPKS PRKHN YRNE S AREELCDS PHQNLS RPLLENKLK 
AFSIGKMSTAKRTLS KKEQEELKKKEDEKAAAEI YEEFLAAFEG 
SDGNKVKTFVRGGWNAAKEEHETDEKRGKIYKPSSRFADQKETP 
PNQSSNERPPSLLVIETKKPPLKKGEKEKKKSNLELFKEELKQI 
QEERDERHKTKGRLSRFEPPQSDSDGQRRSMDAPSRRNRSSGVL 
DDYAPGSHDVGDPSTT\NFYLGNl\NPQMNLKKCCCQEFGRFGP 
LASVKIMWPRTDEERARERNCGFVAFMNRRDAERALKNLNGKMI 
MSFEMKLGWGKAVPIPPHPIYIPPSMHEHTLPPPPSGLPFNAQP 
RERLKNPNAPMLPPPKNKEDFEKTLSQAIVKWIPTERNLLALI 
HRMIEFWREGPMFEAMIMNREINNPMFRFLFENQTPAHVYYRW 
KLYSILQGDSPTKWRTEDFRMFKNGSFWRPPPLNPYLHGMSEEQ 
ETEAFVEEPSKKGALKEEQRDKLEEILRGLTPRKNDIGDAMVFC 
LNNAEAAEE I VDC I TES LS I LKTP LP KK I ARL YLVSDVL YNS S A 
KVANASYYRKFFETKLCQIFSDLNATYRTIQGHLQSENFKQRVM 
TCFRAWEDWAIYPEPFLIKLQNIFLGLVNIIEEKETEDVPDDLD 
GAPIEEELDGAPLSDVDGIPIDATPIDDLDGVPIKSLDDDLDGV 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 
to first 
amino acid 

sequence 


Predicted end 

location 

to first 
amino acid 
residue of 
amino acid 
sequence 


Amine acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proliue, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








I FKVAPSKWEAVDESELEAQAVTTS KWELFD 
QHEESEEEENQNQEEESEDEEDTQSSKSEEHHLYSNPIKEEMTE 
SKFSKYSEMS3EKEAKLHEIBLKVMICFQDELESQKRPKKPGQSF 
QEQVEHYRDKLLCREKEKELERERERDKKDKEKLESRSKDKKEK 
DECTPTRKERKRRHSTSPSPSRSSSGRRVKSPSPKSERSERSER 
SHKESSRSRSSHKDSPRDVSKKAKRSPSGSRTPKRSRRSRSRSP 
KKSGKKSRSQSRSPHRSHKKSKGKTNTGRKFFKKAVTYWKCDLF 
LCPERSVF 


5935 




4493 


SYW^SGWRLSRPPRQFWAGWRGIGRFGTMAPVHGDDCEIGASAL 
SDSGS FVS SRARREKKS KKGRQEALERLKKAKAGERYKYEVEDF 
TGVYEEVDEEQYSKLVQARQDDDWIVDDDGIGYVEDGREIFDDD 
LEDDALDADEKGKDGKARNKDKRUVKKLAVTKPNNIKSMFIACA 
GKKTADKP.VDLSKDGLLGDILQDLNTETPQITPPPVMILKKKRS 
IGASPNPFSVHTATAVPSGKIASPVSRKEPPLTPVPLKRAEFAG 
DDVQVESTEEEQESGAMEFEEGDPDEPMEVEEVDLEPMAAKAWD 
KESEPAEEVKQEADSGKGTVSYLGSFLPDVSCWDIDQEGDSSFS 
VQEVQVDSSHLPLVKGADEEQVFHFYWLDAYEDQYNQPGWFLF 
GKWI3SAETHVSCCVMVKNIERTLYFLPREMKIDLNTGKETGT 
PISMKDVYEEFDEKIATKYKIMKFKSXPVEKWYAFEIPDVPEKS 
EYLEVKYSAEMPQLPQDLKGETFSHVFGTNTSSLELFLMNRKIK 
GPCWLEVKKSTALNQPVSWCKVEAMALKPDL\fWIKEVSPPPLV 
VMAFSMKTMQNAKNHQNE I IAMAALVHHSFALDKAAPKPPFQSH 
FCWSKPKDCIFPYAFKEVIEKKNVKVEVAATERTLLGFFLAKV 
HKIDPDIIVGHNIYGFELEVLLQRINVCKAPHWSKIGRLKRSNM 
PKLGGRSGFGERMATCGRMICDVEISAKELIRCKSYHLSELVQQ 
ILKTERWIPMENIQNMYSESSQLLYLLEHTWKDA\KFILQIMC 
ELNVLPLALQITNIAGNIMSRTLMGGRSERNBFLLLHAFYENNY 
IVPDKQIFRKPQQKLGDEDEEIDGDTNKYKKGRKKGAYAGGLVL 
DP KVGF YDKF1 LLLDFNSLYPS IIQE FNICFTTVQRVASEAQKV 
TEDGEQEQIPELPDPSLEMGILPREIRKLVERRKQVKQLMKQQD 
LNPDLILQYDIRQKALKLTANSMYGCLGFSYSRFYAKPLAALVT 
YKGREILMHTKEMyQKMNLEVIYGDTDSIMINTNSTNLEEVFKL 
GNKVKSEVNKLYKLLEIDIDGVFKSLLLLKKKKYAALWEPTSD 
GNYVTKQELKGLDIVRRDWCDLAKDTGNFVIGQILSDQSRDTTV 
ENIQKRLIEIGENVLNGSVPV3QFEINKALTKDPQDYPDKKSLP 
HVHVALW I NS QGGRKVKAGDT VS YV I CQDGSNLTAS QRAYAPEQ 
LQKQDNLTIDTQYYLAQQIHPWARICEPIDGIDAVLIATGWEL 
\DETQFKVHHYHKDKENUAIjLGGPAQLTDEEKYRDCERFKCPCP 
rrGTr.TITYDNVFDGSGTDMEPSLYRCSNIDCKASPLTFTVQLSN 
KLIMDIRRFIKKYYDGWLICEEPTCRNRTRHLPLQFSRTGPLCP 
ACMKATLQPEYSDKSLYTQLCFYRYIFDAECALEKLTTDHEKDK 
LKKQFFTPKVLQDYRKLKNTAEQFLSRSGYSEVNLSKLFAGCAV 
KS 


593S 


1124 


139 


RGEEQFDAEFRRFACLGFGERLQEFSRLLRAVHRSRAWTCYLAI 
RMLMATCCPSPTTTACTGPWQRAPPLRLLVQKREADSSGLAFAS 
NSLQRRKKGLLLRPVAPLRTRPPLLISLPQDFRQVSSVIDVDLL 
PETHRRVRLHKHGSDRPLGFYXRDGMSVRVAPQG\LERVPGIFI 
SRLVRGGLAESTGLLAVSDEILEVNGIEVAGKTLNQVTDMMVAN 
SHN\LIVTVKPANQRNNWRGASGRLTGPPSAGPGPAEPDSDDD 
SSDLVIENRQPPSSNGLSQGPPCWDLHPGCRHPGTRSSLPSLDD 
QEQASSC-WGSRIRGDGSGFSL 


5937 


31 


1600 


P LLQDKRYQCVYSLAE I FKVLAS FYVILVI L 
YGLTSSYSLWWMLRSSLKQYSFEALREKSWYSDIPDVKNDFAFI 
LHLADQYDPLYSKRFS I FLSEVSENKLKQINLNNEWTVEKLKSK 
LVKNAQDKIELHLFMLNGLPDNVFELTEMEVLSLEL1PEVKLPS 
AVSQLVNLKELRVYHSSLWDHPALAFLEENLKILRLKFTEMGK 
IPRWVFHLKNLKELYLSGCVLPEQLSTMQLEGFQDLKNLRTLYL 
KSSLSRIPQWTDLLPSLQKLSLDNEGSKLVVLNKLKKMVNLKS 
LELISCDLERIPHSIFSLNNLHELDLRENWLKTVEEIISFQHLQ 
NLSCLKLWHNNIAYIPAQIGALSNLEQLSLDHNNIENLPLQLFL 
CTKLHYLDLSYNHI,TFIPEEIQYL\SNLQYFAVTNNN1EMLPDG 
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Predicted 
nucleotide 
rre spending 



nucleotide 

location 

corresponding 



Amine acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenyl alanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=?roline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 



YKGEGFFCNQ3ARGERRKKKKAMSSPNIWSTGSSVYSTPVFSQK 
MTWILLLLSLYPGFTSQKSDDDYEDYASNKTWVLTPKVPEGDV 
TVILNNLLEGYDNKLRPDIGVKPTLIHTDMYVNSIGPVNAINME 
YTIDIFFAQTWYDRRLKFtfSTIKVLRLNSNMVGKIWIPDTFFRN 
SKKADAHWITTPNRMLRIWNDGRVLYSLRLTIDAKCQT,QLHNFP 
MDEHSCPLEFSSYGYPRESIVYQWKRSSVEVGDTRSWRLYQFSF 
VGLRNTTEWKTTSGDYVVMSVYFDL3RRMGYFTIQTYIPCTLI 
WLSWVSFWINKDAVPARTSLGITTVLTMTTLSTIARKSLPKVS 
YVTAMDLFVSVCFIFVFSALVEYG\TLHYFVSWRKPSKDKDKKK 
KNPAPTIDIRPRSATIQMNNATHLQERDEEYGYECLDGKDCASF 
FCCFEDCRTGAWRHGRIHIRIAKMDSYARIFFPTAFCLFNLVYW 
VSYLYL 



IRPGYLKEVQENSPGHRAGLEPFFDFIVSINGSRLNKDNDTLKD 
LLKANVEKPVKMLIYSSKTLELRETSVTPSNLWGGQGLLGVSIR 
FCSFDGANENVWHVLEVESNSPAALAGLRPHSDYIIGADTVMNE 
SEDLFSLIETHEAKPLKLYVYNTDTDNCREyilTPNSAWGGEGS 
LGCGIGYGYLIIRIPTRPFEEQKKISLPGQMAGTPITPLKDGFTE 
VQLS SVNPPSLSPPGTTGIEQSLTGLS ISSTP \ PAVSS VLS TGV 
PTVP\LLPPQVNQSLTSVPPMESSYLHLPGLMPFTRQGLPNLPQ 
PSTFi\LPR\PTHSWPGVGLYQEFVKPGVLPPLSSt1PPRNLPG\l 
APLPLPSEFLPSFPLVPESSSAASSGELLSSLPPTSNAPSDPAT 
7AKAPTTVEDRVGUSTPVSEKPVSAA 



RRSASRSASPRQSAGTAVTTGTRAGGTCIAAAHHRMRWRADGRS 
LEBXPVHMGLVITEVEQEPSFSDIASLVVWCMAVGISYISVYDH 
3GIFKRNNSRLMDEILKQQQELLGLDCSKYSPEFANSNDKDDQV 
LNCHLAVKVLSPEDGKADIVRAAQDFCQLVAQKQKRPTDLDVDT 

LAWYLVQMWLILI 

MCLGRMGASSPRSPEPVGPPAPGLPFCCGGSLLAWVLLALPVA 
WGQCNAPEW\LPFARPTNLTDEFEFPIGTYLNYECRPGYSGRPF 
S 1 1 CLKNS VWTGAKDRCRRKS CRNPPDPVHGKVHVI KGIQFGSQ 
IKYSCTKGYRLIGSSSATCIISGDTVIWDNETPICDRIPCGLPP 
TITNGDFISTNRENFHYGSWTYRCNPGSGGRKVFELVGEPSIY 
CTSNDDQVGIWSGPAPQCIIPNKCTPPNVENGILVSDNRSLFSL 
NEWEFRCQPGFVMKGPRRVKCQALNKWEPELPSCSRVCQPPPD 
VLHAERTQRDKDMFSPGQEVFYSCEPGYDLRGAASMRCTPQGDW 
PAAPTCEVKSCDDFMGQLLNGRVLFPVNLQLGAKVDFVCDEGF 
QLKGSSASYCVLAGMESLWNSSVPVCEQIFCPSPPVIPNGR1 
KPLEVFPFGKAVNYTCDPHPDRGTSFDLIGESTIRCTSDPQGNG 
VWSSPAPRCG1LGHCQAPDHFLFAKLKTQTNASDFPIGTSLKYE 
CRPEYYGRPFSITCLDNLWSSPKDVCKRKSCKTPPDPVNGMVH 
VITDIQVGSRINYSCTTGHRLIGHSSAECILSGNAAHWSTKPPI 
CQRIPCGLPPTIANGDFISTNRENFHYGSWTYRCNPGSGGRKV 
FELVGEPS IYCTSNDDQVGIWSGPAPQCI IPNKCTPPIdVENGIL 
VSDNRSLFSLNEWEFRCCPGFVMKGPRRVKCQALNKWEPELPS 
CSRVCQPPPDVLHAERTQRDKDNFSPGQEVFYSCEPGYDLRGAA 
SMRCTPQGDWSPAAPTCEVKSCDDFMGQLLNGRVLFPVNLQLGA 
KVDFVCDEGFQLKGSSASYCVLAGMESLWNSSVPVCEQIFCPSP 
PVI PNGRHTGKPLE VFP FGKAVNYTCDPHPDRGTS FDLIGES T I 
RCTSDPQGNGVWSSPAPRCGILGHCQAPDHFLPAKLKTQTNASD 
FPIGTSLKYECRPE YYGRPFS I TCLDNLVWS S PKDVCKRKSCKT 
PPDPVNGMVHVITD I QVGSRINYS CTTGHRL1GHSSAECI LSGN 
TAHWSTKPPICQRIPCGLPPTIANGDFISTHR3NFHYGSWTYE 
CNLGSRGRKVFELVGEPSIYCTSNDDQVGIWSGPAPQCIIPNKC 
TPPNVENGILVSDNRSLFSLNEWEFRCQPGFVMKGPRRVKCQA 
LNKWEPELPSCSRVCQPPPEILHGEHTPSHQDNFSPGQEVFYSC 
EPGYDLRGAASLHCTPQGDWSPEAPRCAVKSCDDFLGQLPHGRV 
LFPLNLQLGAKVSFVCDEGFRLKGSSVSHCVLVGMRSLWNNSVP 
VCEHIFCPIJPPAILHGRHTGTPSGDIPYGKEISYTCDPHPDRGM 
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Glutamic Acid, F=Phenylalanine, G=Glycine, 
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S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion! 








TFHLIGESTIRCTSDPHGNGVWSSPAPRCELSVRAGHCKTPEQF 
PFASPTIPINDFEFPVGTSLNYECRPGYFGKMFSISCLENLVWS 
SVEDNCRRKSCGPPPEPFNGMVHINTDTQFGSTVNYSCNEGFRL 
IGSPSTTCLVSGNNVTWDKKAPICEIISCEPPPTISNGDFYSNN 
RTS FHNGT WTYQCHTGPDGEQLFELVGERS IYCTS KDDQVGVW 
SSPPPRCISTNKCTAPEVENAIRVPGNRSFFSLTEIIRFRCQPG 
FVMVGSHTVQCQTNGRWGPKLPHCSRVCQPPPEILHGEHTLSHQ 
DNFSPGQEVFYSCEPSYDLRGAASLHCTPQGDWSPEAPRCTVKS 
CDDFLGQLPHGRVLLPLNLQLGAKVSFVCDEGFRLKGRSASHCV 
LAGMKALWNSSVPVCEQIFCPNPPAILNGRHTGTPLGDIPYGKE 
VSYTCDPHPDRGMTFNLIGESTIRRTSEPHGNGVWSSPAPRCEL 
PVGAACPHPPKIQNGHYIGGHVSLYLPGMTISYTCDFGYLLVGK 
SFIFCTDQGIWSQLDHYCKEVNCSFPLFMNGISKELEMKKVYHY 
GDYVTLKCEDGYTLEGSPWSQCQADDRWDPPLAKCTSRTHDALI 
VGTLSGTIFFILLIIFLSWIILKKRKGNNAHENPKEVAIHLHSQ 
GGESVHPRTLQTNEENSRVLP 




4509 


S88 


YLYVRMRANPLAYGISHKAYQIDPPL\RKHREQ\LVIE\VGRKL 
DK\AQMIRFEERTGYFSSTDLGRTASHYYIKYNTIETFNELFDA 
HKTEGDIFAIVSKAEEFDQIKVREEEIEELDTLLSNFCELSTPG 
GVENSYGKINILLQTYINP.GEMDSFSLISDSAYVAQNAARIVRA 
LFEIALRKRWPTMTYRLLMLSKAIDKRLWGWASPLRQFSILPPH 
MLTRLEEKKLTVDKLKDMRKD3IGHILHHVWIGLKVKQCVHQIP 
SVMMEAFIQPITRTVLRVTLSIYADFTWNDQYHGTVGEPWWIWV 
EDPTNDHIYHSEYFLALKKQVISKEAQLL.VFTIPIFEPLPSQYY 
IRAVSDRWLGAEAVCIINFQHLIT.PERHPPHTELLDLQPLPITA 
IGCKAYEALYKFSHFNPVQTQIFHTLYHTDCNVLLGAPTGSGXT 
VAAELAI FRVFNK YPTS KAV Y I APLKALVRERMDDWKVR IEEKL 
GKKVIELTGDVTPDMKSIAKADLIVTTPEKWEGVSRSWQNRNYV 
QQVTILIIDEIHLLGEERGPVLEVIVSRTNFISSHTEKPVRIVG 
LSTALANARDLADWLNIKQMGLFNFRPSVRPVPLEVHIQGFPGQ 
HYCPRMASMNKPAFQAIRSHS PAKPVL I FVS SRRQTRLTALEL I 
AFLATEEDPKQWLNMDEREMENIIATVRDSNLKLTLAFGIGMHH 
AGLHERDRKTVEELFVNCKVQVLIATSTLAWGVWFPAHLVIIKG 
7EYYDGKTRRYVDFPITDVLQMMGRAGRPQFDDQGKAVILVHDI 
KKDFYKKFLYEPFPVESSLLGVLSDHLNAEIAGGTITSKQDALD 
YITWTYFFRRLIMNPSYYNLGDVSKDEVKKFLSHLIEKSLIELE 
LSYCI3IGEDNRSIEPLTYGRIASYYYLKHQTVKMFKDRLKPEC 
ST3ELLSILSDAEEYTDLPVRHNEDHKNSEIAKCLPIESKPHSF 
DSPHTKAHLLLOAHLSRAMLPCPDYDTDTKTVLDQALRVCQAML 
DVAANQGWLVTVLNITNLIQMVIQGRWLKDSSLLTLPNIENHHL 
HLFECKWKPIMKGPHARGRTSIECLPELIHACGGKDHVFSSMVES 
ELHAAKTKQAWNFLSHLPEINVGISVKGSWDDLVEGHNELSVST 
LTADKRDDNKWIKLHADQEYVLQVSLQRVHFGFHKGKPESCAVT 
PRFPKSKDEGWFLILGEVDKRELIALKRVGYIRNHHVASLSFYT 
PEIPGRYIYTLYFMSDCYLGLDQQYD/NLSQRYXSESFCTGQHQ 
GL 


5943 


1 


2274 


' Tl HKTYLSSSKAKMAAAEGPVGDGELWQTWLPHHWFLRLR 
3GLKWQSPTEAEKPASSSLPSSPPPQLLTRNWFGLGGELFLWD 
C-EDSSFLWRLRGPSGGG\EEPALSQYQRLLCINPPLFE1YQVL 
LSPTQHHVALIGIKGLMVLELPKRWGKNSEFEGGKSTVNCSTTP 
VAERFFTSSTSLTLKHAAWYPSEILDPHWLLTSDNVIRIYSLR 
HPQTPTNVIILSEAEEESLVLKKGRAYTASLGETAVAFDFGPLA 
AVPKTLFGQNGKDEWAYPLYILYENGETFLTYISLLHSPGN/I 
KKAVGSIAHAS \AAEDNYGYDACAVLCL-PCVPNILiVIATESGML 
YHCWLEGEEEDDHTSEKSWDSRIDLIPSLYVFECVELELALKL 
ASGEDDPFDSDFSCPVKLHRDPKCPSRYHCTHEAGVHSVGLTWI 
HJ ! T . FLGSDEEDKDSLQELSTEQKCFVEKILCTKPLPCRQPAP 
IRGFWIVPDILGPTMICITSTYECLIWPLLSTVHPASPPLLCTR 
EDVEVAESPLRVLAETPDSFEKHIRSILQRSVANPAFLKASEKD 
IAPPP3ECLQLLSRATQVFREQYILKQDLAKEEIQRRVKLLCDQ 
KKKQLEDLSYCREERKSLREMAERLADKYEEAKEKQEDIMNRMK 
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< I i-HSFHSELPVLSDSERDMKKELQLIPDQLRHLGNAIKQVTMK 
KDYQQQKMEKVLSLPKPT1ILSAYQRKCIQS1LKEEGEHIREMV 
KQINDIRNHVNF 


5944 


157 




FSIATFTDEPEVLTEPPSATTTTTIGISATWTTLAGSHGKRNNT 
ITTTSSKRKNRKNKITPENVQ1IFDDPLPISYSQPEKVNGESKS 
SSTSESGDSDNMRISSCSDESSNSNSSRKSDNHSPAWTTTVSS 
KKQPSVLVTFPKEERKSVSGKASIKLSETISEGTSNSLSTCTKS 
C-PSPLS S PNGKLTVAS PKRGQKREEGWKE WRRSKKVSVPS TVI 
SRVIGRGGCNINAIREFTGAHIDIDKQKDKTGDRIITIRGGTES 
TRQATQLINALIKDPDKEIDELIPKNRLKSSSANSKIGSSAPTT 
TAANTSLMGIKMTTVALSSTSQTATALTVPAISSASTHKTIKNP 
VN\N\ r RPGFPVSFP\LAYPPPQFAHALLAAQTFQQIRPPRLPMT 
HFGGTFPPAQSTWGPFPVRPLSPARATNSPKPHMVPRHSNQIJSS 
GSQVNSAGSLTSSPTTTTSSSASTVPGTSTMGSPSSPSVRRQLF 
VTWKTSNATTTTVTTTASNWNTAPTNATYPMPTAKEHYPVSSP 
SSPSPPAQPGGVSRNSPLDCGSASPNKVASSSEQEAGSEPWET 
TNTRPPNSSSSSGSSSAHSNQQQPPGSVSQEPRPPLQQSQVPPP 
EVRMTVPPLATSSAPVAVPSTAPVTYPMPQTPMGCPQPTPKMET 
PAIRPPPHGTTAPHRNSASVQNSSVAVLSVNHIKRPHSVPSSVQ 
LPSTLSTQSACQNSVHPANKPIAPNFSAPLPFGPFSTLFENSPT 
SAHAFWGGSWSSQSTPESMLSGKSSYLPNSDPLHQSDTSKAPG 
FRPPLQRPAPSPSGIVNMDSPYGSVTPSSTHLGNFASNISGGQM 
YGPGAPLGGAPAAANFNRQHFSPLSLLTPCSSASNDSSAQSVSS 
GVRAPSPAPSSVPLGSEKPSNVSQDRKVPVPIQTERSARIRQTG 
TSAPSVIGSNLSTSVGHSGIWSFEGIGGNQDKVDWCNP.J C I 1 
IHRPMSDPGVFSQHQAMERDSTGIVTPSGTFHQHVPAGYMDFPK 
VGGMPFSVYGNAMIPPVAPIPDGAGGPIFNGPHAADPSWNSLIK 
MVSSSTENNGPQTVWTGPWAPHMNSVHMNQLG 


5945 


1461 


197 


GVTHLFLFGKRKLRNGIAEDLKGQADFFFLLVSEAWATGSPRA 
WLTCLILPLPGIIFSVLPKAMSRPLLITFTPATDPSDLWKDGQQ 
QPQPEKPESTLDGAAARAFYEALIGDESSAPDSQRSQTEPARER 
KRKKRRIMKAPAAEAVAEGASGRHGQGRSLEAEDKMTHRILRAA 
QEGDLPELRRLLEPHEAGGAGGNINARDAFWWTPLMCAARAGQG 
AAVSYLLGRGAAWVGVCELSGRDAAQLAEEAGFPEVARMVRESH 
GETRSPENRSPTPSLQYCENCDTHFQDSNHRTSTAHLLSIjSQGP 
QPPNLPLGVPISSPGFKLLLRGGWEPGMGLGPRGEGRANPIPTV 
LKRDQEGLGYRSAPQPRVTHFPAWDTRAVAGRE\TPPRVATLSW 
REERRREE \ KDRAWERDLRTYMNLEF 


5946 


541 


1S66 


I_GSYSSIQPEEYS\SWC\EWLQDLLA\YVSPK\HSYLRDLP 
SEGSPaRVUSIDFV\EL\EHLQPDVLVHAVLRWDF/TILTEAV 
YSYRG2KQKKVMLTVEQAQDQHYALVLWGPGAAW\YPQLQRKKG 
YIWEFKYLFVQCNYTLENLELHTTPWSSCECLFDDDIRAITFKA 
KFQKSAPS FVKI SDLATHLED KCSG WL I KAQ I SELAFP I TASQ 
KIALWAHSSLKSIFSSLPNIVYTGCAKCGLELETDENRIYKQCF 
ECLPFTMKKIYYRPALMTAIDGRHDVCIRVESKLIEKILLNISA 
DCLMRVIVPSSEITYGMWADLFHSLLAVSAEPCVLKIQSLFVL 
DENSYPLQQDFSLLDFYPDIVKHGANARL 


5947 


3 


1317 


RG.PDRRRRGPIGRVNMDLENKVKKMGLGHEQGFGAPCLKCKEK 
CEGFELHFWRKICRNC\NVAKKSM/TVIjLSNEEDRKVGKLFEDT 
KYTTL I AKLKSDGI PMYKRNVMI LTNPVAAKKNVS INTVTYEWA 
PPVQNQALARQYMQMLPKEKQPVAGSEGAQYRKKQLAKQLPAHD 
QDPSKCHELSPREVKEMEQFVKKYKSEALGVGDVKLPCEMDAQG 
PI . 'nIPGGDRSTPAAVGAMEDKSAEHKRTQYSCYCCKLSMKEG 
DPAIYAERAGYDKLWKPACFVCSTCHELLVDMIYFWKNEKLYCG 
RHYCDSEKPRCAGCDELIFSNEYTQAENQNWHLKHFCCFDCDSI 
LAGEIYVMWDKPVCKPCYVKNHAWCOGCHNAIDPEVQRVTYN 
NFS>;:iASTECFLC3CCSKCLIGQKFMPVEGMVFCSVECKKRMS 


5948 


39 




iUb ' L \ ^SALEVCWDFLSGLTEGSLLPEGFFSGPIDQ 
GNHYQMRRKGRCHRGSAARHPSSPCSVKHSPTRETLTYAQAQRM 
E !^_EGRLHRISIFDPLEIILEDDLTAQEMSECNSNKENSERP 
PVCLRTXRHKNNRVKKKNEALPSAHGTPASASALPEPKVR1VEY 
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S?-SAPRRPPVYYKFiEK5AEELDNEVEYDMDEEDYAWLEIVNE 
KRKGDCVPAVSQSMFEFLMDRFEKESHCENQKQGEQQSLIDEDA 
VCCICMDGECQNSNVILFCDMCNLAVHQECYGVPYIPEGQWLC/ 
RAHCLQSRARPADCVLCPNKGGAFKKTDDDRWGHV\VCALW\IP 
E\VGFANTVFIEPIDGVRNIPPARWKLT\CNLCKEKGR/VGACI 
QCHKANCYTAFHVTCAQKAGLYMKMEPVKELTGGGTTFSVRKTA 
YCDVHTP PGCTRRPLN I YGDVEMKNGVCRKESSVKTVRS TSKVR 
KKAKKAKKALAEPCAVLPTVCAPYIPPQRLMRIANQVAIQRKKQ 
FVERAHSYWLLKRLSRNGAPLLRRLQSSLQSQRSSQQRENDEEM 
KAAKEKLKYWQRLRHDLERARLLIELLRKREKLKREQVKVEQVA 
MELRLTPLTVLLRSVLDQLQDKDPARIFAQPVSLKEVPDYLDHI 
KHPMDFATMRKRLEAQGYKNLHEFEEDFDLIIDNCMKYNARDTV 
FYRAAVRLRDQGGWLRQARREVDSIGLEEASGMHLPERPAAAP 
RRPFSWEDVDRLLDPANRAHLGLEEQLRELLDMLDLTCAMKSSG 
SRSKRAKLLKKEIALLRNKLSQQHSQPLPTGPGLEGFEEDGAAL 
GPEAGEEVLPRLETLLQPRKRSRSTCGDSEVEEESPGKRLDAGL 
TNGFGGARSEQEPGGGLGRKATPRRRCASESSIS3SNSPLCDSS 
FNAPKCGRGKPALVRRHTLEDRS EL I S CIENGNYAKAAR IAAEV 
CQSSMWISTDAAASVLEPLKVVWAKCSGYPSYPALIIDPKMPRV 
PGEHNGVTIPAPPLDVLKIGEHMOTKSDEKLFLVLFFDNKRSWQ 
HLPKSKMVPLGIDETIDKLKMMEGP.NSSIRKAVRIAFDRAMNHL 
SRVHGEPTSDLSDID 


5949 


39 


3370 


YRERYPVSGGSVLRSALEVCWDFLSGLTEGSLLPEGFFSGPIDQ 
C^KYyMraKGKCHRGSAARHPSSPCSVKHSPTRETLTYAQAQRM 
VE I E I EGRLHR I S I FDPLE I I LECDL 2EMSSCNSNKEI IS 
PVCLRTKRHKNNRVKKKNEALPSAHGTPASASALPEPKVRIV3Y 
SPPSAPRRPPVYYKFIEKSAEELDNEVEYDMDEEDYAWLEIVNE 
KRKGDCVPAVSQSMFEFLMDRFEKESHCENQKQGEQQSL I DEDA 
VCCICMDGECQNSNVILFCDMCNLAVHQECYGVPYIPEGQWLC/ 
RAHCLQSRARPADCVLCPNKGGAFKKTDDDRWGHV\VCALW\IP 
E \VGFANTVFIEP I DGVRK I PPARWKLT \ CNLCKEKGR/VGAC I 
QCHKANCYTAFHVTCAQKAGLYMKMEPVK3LTGGGTTFSVRKTA 
YCDVHTPPGCTRRPLNIYGDVEMKNGVCRKESSVKTVRSTSKVR 
KKAKKAKKALAEPCAVLPTVCAPYIPPQRLNRIANQVAIQRKKQ 
FVERAHSYWLLKRLSRNGAPLLRRLQSSLQSQRSSQQRENDEEM 
KAAKEKLKYWQRLRHDLERARLLIELLRKREKLKREQVKVEQVA 
MELRLTPLTVLLRSVLDQLQDKDPAR I FAQPVSLKEVPDYLDHI 
t 4 r ~ FATf 1RKRLEAQG YKNLHE FEEDFDLI IDNCMKYNARDTV 
FYRAAVRLRDQGGVVLRQARREVDSIGLEEASGMHLPERPAAAP 
RRPFSWEDVDRLLDPANRAHLGLE3QLRELLDMLDLTCAMKSSG 
SRSKRAKLLKKEIALLRNKLSQQHSQPLPTGPGLEGFEEDGAAL 
GPEAGEEVLPRLETLLQPRKRSRSTCGDSEVEEESPGKRLDAGL 
TNGFGGARSEQEPGGGLGRKATPRRRCASESSISSSNSPLCDSS 
FNAPKCGRGKPALVRRHTLEDRSELISCIENGNYAKAARIAAEV 
WISTDAAASVLEPLKVVWAKCSGYPSYPALIIDPKMPRV 
r 3 WTIPAPPLDVLKIGEKMQTKSDEKLFLVLFFDNKRSWQ 
WLPKSKMVPLGIDETIDKLKMMEGRNSS1RKAVRIAFDRAMNHL 
SRVHGEPTSDLSDID 








SRSLT 1STSQPGACPCQGAASRPAILYALLSSSLKAVPRPRSR ~ 
CLCRQHRPVQLCAPHRTCREALDVLAKTVAFLRNLPSFWQLPPQ 
DQRRLLQGCWGPLFLLGLAQDAVTFEVAEAPVPSILKKILLEEP 
£SSGGSGQLPDRPQPSLAAVQWLQCCLESFWSLELSPKE\YACL 
KGPILFMPDVPGLQAASHIGHLQQEAHWVLCEVLEPWCPAAQGR 
LTRVLLTASTLKSIPTSLLGDLFFRPIIGDVDIAGLLGDMLLLR 


5951 




5449 


WNVKPSLLWQLFKFSDKEEHEQNDS I SGKTGETGVEEMIATRK 
VEQDSK3TVKLSHEDDHILEDAGSSDISSDAACTWPNKTEI1SLV 
GLPSCVOEVTECNLELKDTMGIADKTENTLERHKIEPLGYCEDA 
E.SMP.C'LESIEFWKSNLEWDTSTFGPESNILEH'AICDVPDQNSK 
QLNAIES TKIESHETANLQDDRMSQS S S VSYLESKSVKSKHTKP 
VIHSKQWMTTDAPKKIVAAKYEVIHSKTKVNVKSVKRNTDVPES 
QQNFHRPVKVRKKQIDKEPKIQSCNSGVKSVKNQAHSVLKKTLQ 
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H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine , V=Valine, 
W=Tryptophan, Y=Tyrosins, X=Unknown, ,*=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








DQTLVQIFKPLTHSLSDKSHAHPGCLKEPHHPAQTGHVSHSSQK 
QCHKPQQQAPAMKTNSHVKEELEHPGVEHFKEEDKLKLKKPEKN 
LQPRQRRSSKSFSLDEPPLFIPDNIATIRREGSDHSSSFESKYM 
WTPSKQCGFCKKPHGNRFMVGCGRCDDWFHGDCVGLSLSQAQOM 
GEEDKEYVCVKCCAEEDKKTEILDPDTLENQATVEFHSGDKTME 
CEKLGLSKHTTNDRTKYIDDTVKHKVKILKRESC-EGRNSSDCRD 
NEIKKWQLAPLRKMGQPVLPRRSSEEKSEKIPKESTTVTCTGEK 
ASKPGTHEKQEMKKKKV\EKGVLNVHPAASA3KPSADQIRQSVR 
HSLKDILMKRLTDSNLKVPEEKAAKVATKIEKELFSFFRDTDAK 
YKHKYRSLMFNLKDPKMNILFKKVLKGEVTPDHLIRMSPEELAS 
KELAAWRRRENRHTIEMIEKEQREVERRPITKITHKGEIEIESD 
APMKE2EAAMEIQEPAANKSLEKPEGSEK\RKEEVDSMSKDTTS 
QHRQHLFDLNCKICIGRMAPPVDDLSPKKVKVWGVARKHSDNE 
AESIADALSSTSNILASEFFEEEKQESPKSTFSPAPRPEMPGTV 
EVESTFLARLNFIWKGFINMPSVAKFVTKAYPVSGSPEYLTEDL 
PDS IQVGGR ISPQTVWDYVEKIKASGTKE I CWRFTPVTEEDQI 
SYTLLFAYFSSRKRYGVAANKMKQVKDMYLIPLGATDKIPHPLV 
PFDGPGLELHRPNLLLGLIIRQKLKRQHSACASTSHIAETPESA 
PPIALPPDKKSKIEVSTE3APEEENDFFNSFTTVLHKQRNKPQQ 
NLQEDLPTAVEPLMEVTKQEPPKPLRFLPGVLIGWENQPTTLEL 
ANKPLPVDDILQSLLGTTGQVYDQ\AQSVMEQNTVKEIPFLNEQ 
TNSKIEKTDNVEVTDGENKEIKVKVDNISi;STDKSAEIETSWG 
SSSISAGSLTSLSLRGKPPDVSTEAFLTNLSIQSKQEETVESKE 
KTLKRQLQEDQENNLQDNQTSNSSPCRSNVGKGNIDGNVSCSEN 
DPRQAAGRSQPVTTSESKDGDSCRNGEK 
HMLPGLSHNKEHLTEQINVEEKLCSAEKNSCVQQSDNLKVAQNS 
PSVENIQTSQAEQAKPLQEDILMQNIETVHPFRR3SAVATSHFE 
VGNTCPSEFPSKSITFTSRSTSPRTSTNFSPMRP2QPNLQHLKS 
SPPGFPFPGPPNFPPQSMFGFPPHLPPPLLPPPGFG\FA\QNPM 
VPWPPW\HLP\GQPQRMMGPLSQASRYIGPQHFYQVKDIRRPE 
RRHSDPWGRQDQQQLDRPFNRGKGDRQRFYSDSHHLKRERHEKE 
WEQESERHRRRDRSQDKDRDRKSREEGHKDKERARLSHGDRGTD 
GKASRDSRNVDKKPDKPKSEDYEKDKEREKSKHREGEKDRDRYH 


5952 


322b 


S3 9 


pparrsardlpralsmeaarpsgswngalcrliAlvtl\aflif 
asdacknvtlhvpskldaeklvgrvklkecftaanlihssdpdf 
qil3dgsvyttntillssekrsftillsntenqekkkifvfleh 
2tkvlkkrhtkekvlrrakrrwapipcsmlenslgpfplflqqv 
□ ■-rnaqt-jytiyysirgpgvdqeprnlfyverdtgnlyctrpvdr 
eqyesfeiiafattpdgytpelplpliikiedendnyp1fteet 
ytftifencrvgttvgqvcatdkdepdtmhtrlkysiigqvpps 
ptlfsmhpttgvitttssqldrelidkyqlkikvqdmdgqyfgl 
qttstciiniddvndhlptftrtsyvtsveentvdveilrvtve 
dkelvntanwranytilkgnengnfkivtdaktwegvlcwkpl 
nyeekqqmilqigwneapfsreasprsamstatvtvnvedqde 
gpecnppiqtvrmkenaevgttsngykaydpetrsssgirykkl 
tdptgwvt i dentgs ikvfrs ldreaeti knc- 1 ynitvlasdqg 
c-rtctgtlgiilqdvndnspfipkktviickptmssaeivavdp 
depihgppfdfslesstsevqrmwrlkaindtaarlsyqndppf 
gsywpitvrdrlgmssvtsldvtlcdcitendcthrvdprigg 
ggvqlgkwailaillgialffcilftlvcgasgtskqpkvipdd 
laqqnlivsnteapgddkvysangfttqtvgasaqgvcgtvgsg 
iknggqetiemvkgghqtsescrgaghhhtldscrgghtevdwc 
rytysewhsftqprlgees irghtlikn 


5953 




Bll 


PLLCNPDPGWYWWVKQESE I SKESQEMDARPKLDLGFKEGQT I K 
LCIGNITNKKGGASKPRTARGGGLSLLPPPPGGKVTIPPPSS/V 
KLPSTNHVTPPSIPKSNHGGSDADILLDLDSPAPVTTPAPTPVS 
VSNDLWGDFSTASSSVPNQAPQPSNWVQF 


5954 


32 


2130 


PPPPPPKLAWMADLEAVLADVSYLMAMEKSKATPAARASKRIVL 
PEPSIRSVMQKYLAERNEITFDKIFNQKIGFLLFKDFCLNEINE 
AVPQVKFYEEIKEYEKLDNEEDRLCRSRQIYDAYIMKELLSCSH | 
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SEQ 
ID 


Predicted 

nucleotide 
location 
corresponding 
to first 
amino acid 

amino acid 
sequence 


Pr -dieted end 

location e 
corresponding 

amino acid 
residue of 
amino acid 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartlc Acid, E= 
Glutamic Acid, F=p:-ienylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion! 








PFSKQAVSHVQSHLSKKQVTSTLFQPYIEEICESLRGDIFQKFM 
ESDKFTRFCQWKNVELNIHLTMNEFSVHRIIGRGGFGEVYGCRK 
ADTGKMYAMKCLMKKRIKI1KQGETIALNERIMLSLVSTGDCPFI 
VCMTYAFHTPDKLCFILDLMNGGDLHYHLSQHGVFSEKEMRFYA 
TE 1 1 LGLEHMHNRFWYRDLKPANILLDEHGHARIS \DLGLACD 
FSKKKPHASVGTH3YMAPEVLQKGTAYDSSADWFSLGCMLFKLL 
RGHS P FRCHKTKDKHE IDRMTLTVNVELFDTFS PELKSLLEGLL 
QRDVSKRLGCHGGGSQEVKEHSFFKGVDWQHVYLQKYPPPLIPP 
RGEWAAEAFDIGSFDEEDTKGIKLLDCDQELYKNFPLVISERW 
QQEVTETVYEAVNADTDKIEARKRAKNKCLGHEEDYALGKDCIM 
HGYMLKLGNPFLTQWQRRYFYLFPNRLEWRGEGE3RQNLLTMEQ 
1LSVEETQIKDKKCILFRIKGGKQFVLQCESDPEFVCWKKELNE 
TFKEAQRLLRRAPKFLNKPRSGTVELPKPSLCHRNSNGL 


5955 


172S 


44-1 


KREREFRLAVCPLRYPSAYESSPGTELRECGLCRSGQEFADCRR "" 

PANRQDVLSGWINLPVLQLTKDPLKTPGRLDHGTRTAFIHHREQ 

VWKRCINIWRDVGLFGVLNEIANSEEEVFEWVKTASGWALALCR 

WAS S LHGSLFPHLSLRSEDLI AE FAQVTNWS S CCLRVFAWHPHT 

NKFAVALLDDSVRVYNASSTIVPSLKHRLQRNVASLAWKPLSAS 

VLAVACQSCILIWTtDPTSLSTRPSSGCAQVLSHPGHTPVTSLA 

WAPSGGRLLSASPVDAAIRVWDVSTETCVPLPWFRGGGVTWLLW 

SPDGSKILATTPSAVFRVWEAQMKTCERWPTLSGRCQTGCWSPD 

GSRLLFTVLGEPLIYSLSFPERCGEGKG\AIjEVQSQQRLWQICL 

RCOYRHOMVRRGLGERLTPWSGTPVGNVWLCL 


SS5S 


1705 


139 


GVGVRGARAMATVQEKAAALNLSALHS PAHRE PG FSVAQKP FGA 
TYVWS S I INTIjQTQVEVKKRRHRI.KRHNDCFVGSEAVDVI FSHL 
IQNKYFGDVDIPRAKWRVCQALMDYKVFEAVPTKVFGKDKKPT 
FEBSSCSLYRFTTIPNQDSQLGKENKLYSPARYADALFKSSDIR 
SASLEDLWENLSLKPANSPHVNISATLSPQVINEVWQEETIGRL 
LQLVDLPLLDSLLKQQEAVPKIPQPKRQSTMVWSSNYLDRGILK 
AYSDSQEDEWLSAAIDCSEYLPDQMWEISRSFPEQPDRTDLVK 
ELLFDAIGRYYSSREPLLNHLSDVHNGIAELLVNGKTEIALEAT 
QLLLKLLDFQNREEFRRLLYFMAVAAMPSEFKLQKESDNRMWK 
R I FSKAI VDNKNLSKGKTDLLVLFIAMDHQKDVFKI pgtl \hki 
VS\VK\LM.ZiIQNGRDPNRDAGYIYCQRIDQREYSNNTEKTTKDE 
LLNLLKTLDEDSKLSAKEKKK\LLGQFYKCHPDIFIEHFGD 


5957 


1479 


451 


ELQVAVAMDTLDRWKPKTKRAKRFLEKREPKLNENIKNAMLIK 
GGNANATVTKVLKDVYALKKPYGVIjYKKKNITRPFEDQTSLEFF 
SKKSDCSLFMFGSHWKttRPNKLVIGRMYDYHVLDMIELGIENFV 
SLKDIKNSKCPEGTKPMLIFAGDDFDVTEDYRRLKSLLIDFFRG 
PTVSNIRLAGLEYVLHFTALNGKIYFRSYKLLLKKSGCRTPRIE 
LEEMGPSLDLVLRRTHIASDDLYKXSMKMPKALKPKKKKNISHD 
TFGTTYGR IHMQKQDLS KLQTRKM\KGLKKRPAER I T3DHEKKS 
KRIKKKLMELSQPLLFHCVLLKRI IKHQS IQSFL 


5958 


1 


3138 


AAALGMIiLWFPACQAFNLDVEKLTVYSGPKGSYFGYAVDFHIPD 
ARTASVLVGAPKANTSQPDIVEGGAVYYCPWPAEGSAQCRQ1PF 
DTTNNRKIRVNGTKEPIEFKSNQWFG\ATVKA\HKGKSCGPVAP 
LLFTWRNFLKPTPEKGPVGTCYVAIQNFSAYAEFSPCGNSNADP 
EGQ3YCQAGFSLDFYKNGDLIVGGPGSFYWQGQVITASVADIIA 
NY3 FKE ILRKLAGEKQTEVAPASYDDSYLGYSVAAGEFTGDSQQ 
ELVAG I PRGAQNFGYVS I INS YDMTFIQNFTGEQMAS YFGYTW 
VSDVNSDGLDDVLVGAPLFMEREFESNPREVGQIYLYLQVSSLIi 
FRDPGILTGTETFGRFGSAMAHLGDLNQDGYNDIAIGVPFAGKD 
QRGKVLIYNGHKDGLNTKPFPKFCQGVWASHAVPEGFGFTLRGD 
SDIDKNDYPDLIVGAFGTGKVAVYSARPWTVDAQLLLHPMIIN 
LENKTCQVPDSMTSAACFSLRVCASVTGQS IANTIVLMAEVQLD 
SLKQKGAIKRTLFLDNHQAHRVFPLVIKRQKSHQCQDFIVYLRD 
ETEFRDKLSPINISLNYSLDESTFKEGLEVKPILNYYRENIVSE 
QAHILVDCGEDNLCVPDLKLSARPDKHQVI IG3ENHLMLI INAR 
NEGEGAYEAELFVKIPEEADYVGIERKNKGFRPLSCEYKMENVT 
RMWCDLGHPMVSGTNYSLGLRFAVPRLEKTNMSIKFDLQIRSS 
NKDNPDSNFVSLQINITAVAQVEIRGVSHPPQIVLPIHNWEPEE 
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corresponding 
) first 



icleotide 
jrresponding 



Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Gliitamine, R=Arginine, 
S=Serine, T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 
EPHKEEEVGPLVEHIYELHNIGPSTISDTIL3VGWPFSARDEFL 
LYIFHIQTLGPLCCQPNPNINPQDIKPAASP3DTPELSAFLRNS 
TIPHLVRIOIDVHWEFHRQSPAKILNCTNIECLQISCAVGRLEG 
GESAVLKVRSRLWAHTFLQRKNDPYALASLVSFEVKKMPYTEQP 
AKLPEGS I AI KTSVI WATPNVS FS I PLWVI I LAILLGLLVLAIL 

TLALWKCGFFDRARPPQE3MTDREQLTNDKTPEA 

GTSGYAAQQLPSLLKEREFHLGTLNKVFASQNLNHRQWCGTKC 
NTLFWDVQTSQITKIPILKDREPGGVTQQGCGIHAIELNPSRT 
LIATG3DNPNSLAIYRLPTLEPVCVGDDGHKDWIFSIAWISDTM 
AVSGSRDGSMGLWEVTDDVLTKSDARHN\'SRVPVYAHITHKALK 
DIPKEDTNPDNCKVRALAFNNKNKELSAVSLDGYFHLWKAENTL 
SKLLSTKLPYCRENVCLAYGSEWSVYAVGSQAHVSFLDPRQPSY 
NVKSVCSRERGSGIRSVSFYEHIITVGTGQGSLLFYDIRAQRFL 
EERLSACYGSKPRLAGENLKLTTG\KGWLNHDETWRNYFSDIDF 
FPHAVYTHCYDSSGTKLFVAGGPLPSGLHGNYAGLWS 



i 51 m i dgtanratnpt,nke — 
ldviasingfceqlnedfegpplatrllahkiqspqeweaiqalt 
vletcmkscgkrfhdevgkfrflnelikwspkylgsrtsekvk 
nkilellyswtvglpeevkiaeayqmlkkqg\ivksdpklpddt 
tfplppprpkotifedeekskmlarllksshpedlraanklike 
mvqedqkrmekiskrvnaieevnnnvklltemvmshsqggaaag 
ssediakkel\yqrcermrptlfptgsvdtedndSealaeilqr 
ndnltqvinlykqlvrgeevngdatags i pgstsalldlsgldl 
ppagttypamptrpgeqaspeqpsasvsllddelmslglsdptp 
psg 1 — il jw is1 . i sdateppapaiiaqapsmesrppaqtslp 
assglddldllgktllqqslppesqqvrwekqqptprltlrdlq 
nkssscsspsssatsllhtvspepprppc.qpvptelslasitvp 
lesikpsnilpvtvydqhgfrilfhfareplpgrsdvlwwsm 
lstapqpirnivfqsavpkvmkvklqppsgtelpafnpivhpsa 
itqvlllanpqkekvrlrykltftmgdqtynemgdvdqfpppet 

WGSL 



SGBPRPEPGNMATCIGEKIEDFKVGNLLGKGSFAGVYRAESIHT 
GLBVAIKMIDKKAMYKAGMVQRVQNEVKIHCQLKHPSILELYNY 
FEDSNYVYLVLEMCHNGEMNRYLKNRVKPFSENEARHFMHQIIT 
GMLYLHSHGILHRDLTLSNLLLTRNMNIKIADFGLATQLKMPHE 
KHYTLCGTPNYISPEIATRSAHGLESDVWSLGCMFYTLLIGRPP 
FDTDTVKNTLNKWLADYEMPTFLS IEAKDL IHQLLRRNPADRL 
SLSSVLDHPFMSRNSSTKSKDLGTVEDSIDSGHATTSTAITASS 
STSISGSLFDKRRLLIGQPLPNKMTVFPKNKSSTDFSSSGDGNS 
FYTQWGNQETSNSGRGRVIQDAEERPHSRYLRRAYSSDRSGTSN 



NFFKEKTSSSSGSFERPDNNQALSNHLCPGKTPFPFADPTPQTE 
TVCQWFGNLQ INAHLRKTTE YDS ISPNRDFQGHPDLQKDTS KNA 
WTETKVKKNSDASDNAHSVKQQNTMKYMTALHSKPEIIQQECVF 
GSDPLSEQSKTRGMEPPWGYQNRTLRSITSPLVAHRLKPIRQKT 
KKAWS I LDSEEVCVELVKE YASQEYVKE VLQ I SSDGNT ITIYY 
PNGG\RGFPLA\DRPPSPT\DNISR\YSF\DNLPEKYWRKYQYA 
SRFVQLVR8KSPKITYFTRYAKCILMENSPGADFEVWFYEGVKI 
HKTEDFIQVIEKTGKSYTLKSESEVNSLKEEIKMYMDHANEGHR 
ICLALESIISEEERKTRSAPFFPIIIGRKPGSTSSPKALSPPPS 
VDSNYPTRDRASFKRMVMHSAASPTQAPILNPSMVTN3GLGLTT 
TASGTDISSNSLKDCLPKSAQLLKSVFVKNVGWATQ\LTSGAVW 
VQFNDGSQLWQAGVSSISYTSPNGQ\TTR\YGENEKLPDYIKQ 

KLQCLS S I LLMFSNPTPNBH 

RVCSSSASTASQAVMADAWEEIRRLAADFQRAQFAEATQRLSER 
NCIEIVNKLIAQKQLEWHTLDGKEYITPAQISKEMRDELHVRG 
GRVNIVDLQQVINVDLIHIENRIGDIIKSEKHVQLVLGQLIDEN 
YLDRLAEEVNDKLQESGQVTISELCKTYDLPGNFIiTQALTQRLG 
RIISGHIDLDNRGVIFTEAFVARHKARIRGLFSAITRPTAVNSL 
ISKYGFQEQLLYSVLE3LVKSGRLRGTWGGRQDKAVFVPDIYS 
RTQSTWVDSFFRQNGYLEFDALSRLGIPDAVSYIKKRYKTTQLL 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 

amino acid 
sequence 


red end 
nucleotide 

to first 
amino acid 
residue of 

sequence 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenyl alanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , tsT=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine / T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








FLKAACVGQGLVDQVEASVEEAI SSGTWVD I APLLPTS LSVEDA 
AILLQQVMRAFSKQASTWFSDTVWSEKF\INDCTELFRELMH 
QKAEKEMKNNPVHLITEEDLKQISTLESVSTSKKDKKDERRRKA 
TEGSGSMRGGGGGNAREYKIKKVKKKGRKDDDSDDESQSSHTGK 
KKPEISFMFQDEIEDFLRKHIQDAPEEFISELAEYLIKPLNKTY 
LEWRSVFMSSTTSASGTGRKRTIKDLQEEVSNLYNNIRLFEKG 
MKFFADDTQAALTKHLLKSVCTDITNLIFNFLASDLMMAVDDPA 
AITSEIRKKILSKLSEETKVALTKLHNSLNEKSIEDFISCLDSA 
AEACDIMVKRGDKKREROILFQHRQALAEQLKVTEDPALILHLT 
SVLLFQFSTHSMLHAPGRCVPQIIAFLNSKIPEDQHALLVKYQG 
LWKQLVSQSKKTGQGDYPLNNELDKEQEDVASTTRKELQELSS 
SIKDLVLKSRKSSVTEE 


5.963 


62 


1130 


PWNPQDFPGNRGLMG\QKGEIGPP\GQQGKKGAPGMP\GLMGSN 
GSPGQPGTPGSKGSKGEPGIQGMPGASGLKGEPGATGSPGEPGY 
MGLPG IQGKKGDKGNQGE KG I QGQKGENGRQGI PGQQG I QGHHG 
AKGERGEKGEPGVRGAIGSKGESGVDGLMGPAGPKGQPGDPGPQ 
GPPGLDGKPGREFSEQFIRQVCTDVIRAQLPVLLQSGRIRNCDH 
CLSUHUSrU 1 t'GP J GP 1 GPEGPRGLPGLPGRDGVPGLVGVPGRP 
GVRGLKGLPGRNGEKGSQGFGYPGEQGPPGPPGPEGPPGISKEG 
PPGDPGLPGKDGDHGKPGIQGQPGPPGICDPSLCFSVIARRDPF 
RKGPNY 


5964 


3 


2147 


3CRTRGRLSPLQPREAGSSRGSRARSEPPRFGGMEEAC0V0TTK 
RGDPHELRNIFLQYASTEVDGERYMTPEDFVQF.YLGLYNDPNSN 
PKIVQLLAGVADOTKDGLISYQEFLAFESVLCAPDSMFIVAFQL 
FDKE GNGEVTFEKVKE I FGQT I IHHHI PFNWDCEFIRLHFGHNR 
XKHLNYTEFTQFLQELQLEHARQAFALKDKSKSGMISGLDFSDI 
MVTIRSHMLTPFVEENLVSAAGGSISHQVSFSYFNAFNSLLNNM 
ELVRKIYSTLAGTRKDAEVTKEEFAQSAIRYGQATPLE1DILYQ 
LADLYNASGRLTLADIERIAPLAEGALPYNLAELQRQQSPGLGR 
PIWLQ IAESAYRFTLGSVAGAVGATAVYP IDLVKTRMQNQRGSG 
SWGELMYKNSFDCFKKVLRYEGFFGLYRGLIPQLIGVAPEKAI 
KLTVNDFVRDKFTRRDGSVPLPAEVLAGGCAGC-SQVIFTNPLEI 
VKIRLQVAGEITTGPRVSALKVLRDLGIFGLYKGAKACFLRDIP 
FSAIYFPVYAHCKLLLADENGHVGGLNLLAAGAMAG\VPAASLV 
TPADVIKTRLQVAARAGQTTYSGVIDCFRKIL\REEGPSAFWKG 
TAARVFRSS PQFG \VTLVTYELLQRGFYIDFGGLKPAGSEPTPK 
SRIADLPPANPDHIGGYRIATATFAC-IENKFGLYLPKFKSPSVA 
WQPKAAVAATQ 


5965 




1498 


MVTWLYRFLPTSNMAAKLRSLLPPDLRLQFWLHARLQKCFLSRG 
CGSYCAGAKASPLPGKMAMGLMCGRRELLRLLQSGRRVHSVAGP 
SQWLGXPLTTRLLFPAAPCCCRPHYLFLAASGPRSLSrSAISFA 
EVQVQAPPWAATPSPTAVPEVASGETADWQTAAEQSFAELGL 
GSYTPVGLIQNLLEFMHVDLGLPWWGAIAACTVFARCLIFPLIV 
TGQREAARIHNHLPEIQKFSSRIREAKLAGDHIEYYKASSEMAL 
1 » K 1 -i-lGl F.LYKPLI LPVTQAPIFI SFFIALREMANLPVPSLQTG 
GLWWFQDLTVSDPIYILPLAVTATMWAVLELGAETGVQSSDLQW 
MRNVIRMMPLITLPITMHFPTAVFMYWLSSNLFSLVQVSCLRIP 
AVRTVLKIPQRWHDLDKLPPREGFLESFKKGWKNAEMTRQLRE 
REQRMRNQLEIAARGPLRQTFTHNPLLQPGKDNPPNI PSS \SSS 
SSKPKSKYPWHDTLG 


5966 






RSKQVMARLTKRRQADTKAIQHLWAAIEIIRNQKQIANIDRITK 
YMSRVHGMHPKETTRQLSLAVKDGLIVETLTVGGKGSKAGIEQE 
GYWLPGDEIDWETENHDWYCFECHLPGEVLICDLCFRVYHSKCL 
SDEFKLRDSSSPWQCPVCRSIKKKNTNKQEMGTYLRFIVSRMKE 
RAIDLNKKGKDNKHPMYRRLVHSAVDVPT1QEKVNEGKYRSYEE 
FKADAQLLLHNTVIFYGADSEQADIARMLYKDTCHEL\DELQLC 
KNi "-'VUANARPDNWFCYPCIPNHELDWAKMKGFGFWPAKVMQKE 
DNQ VD VRFFGHHHQRAW IPSENIQDI TVNI HRLHVKRS MGWKKA 
li Li .L.-iCRFLREGRFWKSKNEDRGEEEAESSISSTSNEGIjKVT 
CEPRAKKGRRNQSVEPKKEEPEPETEAVSSSQEIPTMPQPIEKV 

svstqtkklsassprmlhrstqttndgvcqsmchdkytkifndf 
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SEQ 
ID 
NO: 


beginning 
nucleotide 

corresponding 
sequence 


nucleotide _ 
location 
corresponding 
to first 

residue of 
amino acid 
sequence 


Amine acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, ?=Phenyl alanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, K=Asparagine, 
P=Proline / Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 


59S7 






KDRMKSDHKRETERWREALBKLRSEMEEEKRQAVNKAVANMQG 
EMDRKCKQVKE KCKEEFVEE I KKLATQHKQL I SQTKKKQWCYNC 
EEEAMYHCCWNTS YCS I KCQQEHNHAEHKRTCRRKR 


102 


1925 


RSKQVMARLTKRRQADTKAIQHLWAA1K11RNQKQIANIDR1TK 
YMSKVHGMHPKETTRQLSLAVKDGLIVETLTVGCKGSKAGIEQE 
GYWLPGDEIDNETENHDWYCFECHLPGEVLICDLCFRVYHSKCL 
SDEFRLRDSSSPWQCPVCRSIKKKNTNKQEMGTYLRFIVSRMKE 
RAIDLNKKGKDNKHPMYRRLVHSAVDVPTIQEKVNEGKYRSYEE 
FKADAQLLLH1ITVIFYGADSEQADIARMLYKDTCHEL\DELQLC 
KNCFYLANARPDMWFCYPCIPNHELDWAKMKGFGFWPAKi/MQKE 
DNQVDVRFFGHHHQRAWIPSENIQDITWIHRLHVKRSMGWKKA 
CDELELHQRFLREGRFWKSKNEDRGEEEAESSISSTSNEQLKVT 
QEPRAKKGRRNQSVEPKKEEPEPETEAVSSSQEIPTMPQPIEKV 
SVSTQTKKLSASSPRMLHRSTQTTNDGVCQSMCHDKYTKIFNDF 
KDRMKSDHKRETERWREALEKLRSEMEEEKP.QAVNKAVANMQG 
EMDRKCKQVKEKCKEEFVEEIKKLATQHKQLISQTKKKQWCYNC 
BaEmYh ' I -GIKCQQEHWHAEHKRTCRRKR 


5968 


81 


12 8 3 


^RFPRRGGAPPTVLTPGKQQGVFLGPQRPGSRPDIPARGQPHPP 
RPVGVSTSAQAQVQPPAMHRRRLALGLGFO.LA0TSLSVLWVYL 
SNWLPVSYVPYYLPCPEIFNMKLHYKREKPLQPWWSQYPQPKL 
LEHRPTQLLTLTPWLAPIVSEGTFNPELLQHIYOPIjNLTIGVTV 
FAVGN/'HFLESAEEFFMRGYRVHYYIFTDNPAAVPGVPLGPHRL 
LSSIPIQGHSIIWEETSMRRMETISQHIAKRAKREVDYLFCLDVD 

hvfrnpwgpetlgdlvamhpsyyavprqqfpyerrrvstafva 

DSEGDFYYGGAVFGGQVARVYEFTKGCHMAIIADKAN3IMAAWR 
E3SHLNRHFISNKPSKVLSPEYLWDDRKPQPPSLKLIRFSTLDK 


5969 


1125 


503 


DVGFNIKRKRCDLDVFLESPRKPSGRRDRAPEKQRRIAANKCLC 
TGVREGEPPS/TTSQKVKEAGRDFTYLIWLFGISITGGLFYTI 
FKELFSSSSPSOYGRALEKCRSHPEVIGVFGESVKGYGEVTRR 
GRRQHVRFTEYVKDGLKHTCVKFYIEGSEPGKQGTVYAQVKENP 
GSGEYDFRYIFVEIESYPRRTIIIEDNRSQDD 


5970 


31S 


4712 


SQDNIGHRLLQKHGWKLGQGLGKSLQGRTDPIPIWKYDVMGMG 
RMEMELDYAEDATERRRVLEVEKEDTEELRQKYKDYVDKEKAIA 
KALSDLFANFYCELCDKQYQKHQEFDNHINSYDHAHKQRLKDIjK 
QREFARNVSSRSRKDEKKQEKALRRLHELAEQRKQAECAPGSGP 
MFKPTTVAVDEEGC-EDDKDESATNSGTGATASCGLGSEFSTDKG 
G P F TAVQ I TNTTGLAQAPGLAS QG I S FG I KKNLGTPLQ JCLGVS F 
SFAKXAPVKLESIASVFKDHAEEGTSEDGTKPDEKSSDQGLQKV 
C-DSDGSSNLDGXKEDEDPQDGGSLASTLSKLKRMKREEGAGATE 
PEYYHYIPPAHCKVKPIIFPFLLFMRASEQMDGDNTTHPKNAPES 
KKGSSPKPKSCIKAAASQGAEKTVSEVSEQPKETSMTEPSEPGS 
KAEAKI<ALGGDVSDQSLESHSQKVSETQMCESNSSKETSLATPA 
GKESQEGPKHPTGPFFPVLSKDESTALO.WPSELLIFTKAEPSIS 
YSCNPLYFDFKLSRNKDARTKGTEKPKDIGSSSKDHLQGLDPGE 
PNKSKEVGGEKIVRSSGGRMDAPASGSACSGLNKQEPGGSHGSE 
TEDTGRSLPSKKERSGKSHRHKKKKKHKKSSKHKRKHKADTEEK 
KSKKRKKRKRKKNKSSAPADSERGPKPEPPGSGSPA 
PPRRRRRACDDSQRRS LPAEEGSSGKKDEGGGGS S SQDHGGRKH 
KGELPPSSCQRRAGTKRSSRSSHRSQPSSGDEDSDDASSHRLHQ 
KSPSQYSEEEEEEDSGSEHSRSRSRSGRRHSSHRSSRRSYSSSS 
D SS C3CYSRQRSYSDDSYSDYSDRSRRHSKRSHDSDDSDYAS 
3KHS KRHKYSSSDDDYSLSCSQSRSRSRSHTRERSRSRGRSRS 
SSCSRSRSKRRSRSTTAHSWQRSRSYSRDRSRSTRSPSQRSGSR 
-„--l'.;.HESPEERHSGRRDFIRSKIYRSQSPHYFRSGRGEGPGKK 
DDGRGDDSKATGPPSQNSNIGTGRGSEGDCSPEDKNSVTAKLLI, 
EKIQSRKVERKPSVSEEVQATPNKAGPKLKDPPQGYFGPKLPPS 
UG f 'VLPLIGKLPATRKPNKKCEESGLERGEEQEQSETEEGP? 
3 = 31 -LFGHQFP\SEETTGPLLDPPPEESKSGEVTADHPVAPLG 
PPAHFDCYLGDPTISHNYLPDPSDGNTLESLDSSSQPGPVESSL 
L?IAPDLEHFPSYAPPSGDPSISSTDGAEDA\SLAPLESQPITF 
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SEQ 
ID 


Predicted 
beginning 
nucleotide 

corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


1 Predicted end 
nucleotide 

corresponding 
to first 
amino acid 
residue of 

sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, B= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine ; I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








TPEEKEKYSKLQQAAQQHIQQQLIAKQVKAFPASAALAPATPAL " 
QPIHIQQPATASATSITTVQHAILQHHAAAAAAAIGIHPHPHPQ 
PLAQVHH I PQPHLTP I SLSHLTHS 1 1 PGHPATFLASHP IHIIPA 
SAIHPGPFTFHPVPHAALYPTLLAPRPAAAAATALHLHPLLHPI 
FSGQDLQHPPSHGT 


5971 


53 


2149 


SFLYFVGVDMDNPIGNWDGRFDGVQLCSFACVESTILLHINDII 
PESVTQERRPPKLAFMSRGVGDK3SSSHNKPKATGSTSDPGNRN 
RS3LFYTLNGSSVDSQPQSKSKNTWYIDEVAEDPAKSLTEISTD 
FDRSSPPLQPPPVNSLTTENRFHSLPFSLTKMPNTOGSIGHSPL 
SLSAQSVMEELNTAPVQESPPLAMPPGNSHGLEVGSLAEVKENP 
PFYGVIRWIGQPPGLNEVLAGLELEDECAG\CTDGTF/REGTRY 
FTCALKKALFVKLKSCRPDSRFASLQPVSNQIERCNSLAIWEAY 
LS E WEENTPTQKWEKEGLE IMIG \KKKGI QGHYNS CYLDSTLF 
CL FAFS S VLDTVLLRP KEKNDVE Y YS E TQE LLRTE I VNP LR I YG 
YVCATKIMKLRKILEKVEAASGFTSEEKDPEEFLNILFHHILRV 
EPLLKIRSAGQKVQDCYFYQIFMEKWEKVGVPTIQQLLEWSFIN 
SNLKFAEAPSCLIIQMPRFGKDFKLFKKIFPSLELNITDLLEDT 
PRQCRICGGLAM YECRE C YDDPD I SAGKI KQFCKTCNTQVHLHP 
KRLNHKYNPVSLPKDLPDWDNRIIGCIPCQNMELFAVLCIETSHY 
VAFVKYGKDDSAWLFFDSMADRDGGQNGFNIPQVTPCPEVGEYL 
KMSLEDLHSLDSRRIQGCARRLLCDAIYVPCTQSPTMSLYK. 


5972 


440 


1761 


ILiAGSPSPRDQCSQRQSSGGDKELVTRGCTFSTAWSPSAMTQ 
EPFREELAYDRMPTLERGRQDPASYAPDAKPSDLOLSKRLPPCF 
SHKTWVFSVLMGSCLLVTSGFSLYLGNVFPAEMDYLRCAAGSCI 
PSAIVSFTVSRRNANVIPNFQILFVSTFAVTTTCLIWFGCKLVL 
NPSAININFNLILLLLLELLMAATVIIAARSSEEDCKKKKGSMS 
DSANILDEVPFPARVLKSYSWEVIAGISAVLGGI1ALNVDDSV 
SGPHLSVTFFWILVACFPSAIASHVAAECPNKCLVEVLIAISSL 
TSPLLFTASGYLSFSIMRIVEMFKDYPPAIKPSYDVLLLLLIiLV 
LLLQA/GPQHGHRHPVRALQGQCKAAGCILGHPERPAGAPGWGG 
GQEPPEGVRQGESLESRRGANGPVTPRRGNRVAAPSLAPGMETH 
\"P 


5973 


65 


■ 2007 


NGDGKDLFGHIWAWRSNGIISNFRRSPHAGMAEDEPDAKSPKTG 
GRAPPGGAEAGEPTTLLQRLRGTISKAVQNKVEGILQDVQKFSD 
NDKLYLYLQLPSGPTTGDKSSEPSTLSNEEYMYAYRWIRNHLEE 
HTDTCLPKQSVYDAYRKYCESLACCRPLSTANFGKIIREIFPDI 
XARRLGGRGQSKYCYSGIRRKTLVSMPPLPGLDLKGSESPEMGP 
EVTPAPRDELVEAACALTCDWAERILKRSFSSIVEVARFLLQQH 
LISARSAHAHVLKAMGLAEEDEHAPRERSSKPKNGLENPEGGAH 
KKPERLAQPPKDLEARTGAGPLARGERKKSWESSAPGANNLQV 
NALVARLPLLLPRAPRSLIPPIPVSPPILAPRLSSGALKVATLP 
LS5RAGAPPAAVPIINMILPTVPALPGPGPGPGRAPPGGLTQPR 
GTENREVG1GGDQGPHDKGVKRTAEVPVSEASGQAPPAKAAKQD 
ZEDTASDAKRKRGRPLKKSGGSGERNSTPLKSAAAMESAQSSRL 
PW3TWGSGGEGNSAGGAERPGPMGEAEKGAVLAQG\QGDGTVSK 
GGRGPGSQHTKEAEDKIPLVPSKVSVIKGSRSQKEAFPLAKGEV 
DTAPQGNKDLKEHVLQSSLSQEHKDPKATPP 


5974 


4293 


2200 


LC-LQMHTTSGRIHQAMVTSLWEDNESVTVEWIENGDTKGK\EID ' 

LESIFSLNP\DL\VPDGEIEPSP\ETPPPPASSAKVNKIVKNRR 

TV\ASIKNDPPS\RDNRWGSARARPSQFPEQFSSAQQNGSV\S 

DISPVQAAKKEFGPPSRRKSNCVKEVEKLQEKREKRRLQQQELR 

EKRAQDVDATNPNYEIMCMIRDFRGSLDYRPLTTADPIDEHRIC 

VCTOKRPLKKKETQMKDLDVITIPSKDWMVHEPKQKVDLTRYL 

ENQTFRFDYAFDDSAPNEMVYRFTARPLVET I FERGMATCFAYG 

QTGSGKTHTMGGDFSG3CMQDCSKGIYALAARDVFLMLKKPNYKK 

LELQVYATFFEIYSGKVFDLLNRKTKLRVLEDGKQQVQWGLQE 

REVKCVEDVLKLID IGNS CRTSGQTSANAHS SRSHAVFQ I ILRR 

KGKLHGKFSLIDLiAGNERGADTSSADRQTRLEGAEINKSLLALK 

E CI RALGRNKPHTPFRAS KLTQVLRDS F I GEWSRTCM I AT I S PG 

MAS C ENTLNTLR YANRVKELTVD P TAAGDVRP I MHHP PNQ I \ DD 

LETQWGVGSSPQRDDLKLLCEQNEEEVSPQLFTFHEAVSQMVEM 
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SEQ 
ID 
KTO: 


Predicted 
beginning 

corresponding 

residue cf 
amino acid 


Predicted end 

corresponding 
to first 
amino acid 
residue of 

sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threor.ine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=?ossible nucleotide insertion) 








EJ'/v-jEDHRAVFQESIRWLEDEKALLEMTEBVDYDVDSYATQLE 
AILSCKIDILTELRDKVKSFRAALQEEEQASKQINPKRPRAL 


B97S 


4293 


2200 


LGLQKHTTSGRIHOAMVTSLNEDNE£VTVEWIENGDTKGK\EID 
LE3IFSLNP\DL\VPDGEIEPSP\ETPPPPASSAKVNKIVKNRR 
TV\ASIKUDPPS\RDNRWGSARARPSQFPEQFSSAQQNGSV\S 
DISPVQAAKKEFGPPSRRKSNCVKEVEKLQEKREKRRLQQQELR 
EKRAQDVDATNPNYEIMCMIRDFRGSLDYRPLTTADPIDEHRIC 
VCVRKRPLNKKETQMKDLDVITIPSKDWMVHEPKQKVDLTRYL 
E^QTFRFDYAFDDSAPNEMVYRFTARPLVETIFERGMATCFAYG 
QTGSGKTHTMGGDFSGKNQDCSKGIYALAARDVFLMLKKPNYKK 
LELQVYATFFEIYSGKVFDLLNRKTKLRVLEDGKQQVQWGLQE 
REVKCVEDVLKL I D IGNSCRTSGQTSANAHS SRSHAVFQ 1 1 LRR 
KGKLHGKFS L I DLAGHERGADTS SADRQTRLEGAE INKSLLALK 
ECIRALGRNKPHTPFRASKLTQVLRDSFIGENSRTCMIATISPG 
MASCENTLNTLRYANRVKELTVDPTAAGDVRP1MHHPPNQI\DD 
LETQW3VGS S PQRDDLKLLCEQNEEEVS PQLFTFHEAVS QMVEM 
EEQWEDHRAVFQESIRWLEDEKALLEMTEEVDYDVDSYATCLE 
AILEQKIDILTELRDKVKSFRAALQEEEQASKQINPKRPliAL 


5976 




2 34 9 


"1 I^nyiGIKTLNLVLG\LKRMLEF " 
PEVSWMEVKDPNMKGAMLTNTGKYAIPTIDA\EAYAIGKKEKPP 
FLPEEPSSSSEEDDPIPDELLCLICKDIMTDAWIPCCGNSYCD 
EC-RTALLESDEHTCPTCHQNDVSPDALIANKFLRQAVNNFKNE 
TGYTKRLRKQLPSPPPPI PPPRPLIQRNLQPLMRSP I SRQQDPL 
MI PVTSS S THPAPS I S S LTSNQSSLAPPVSGHPS SAPAP VPDIT 
ATVSISVKSEKSDGPFRDSDNKILPAAALASEHSKGTSSIAITA 
LMEEKGYGVPVLGTPSLLGQSLLHGQLIPTTGPVRINTARPGGG 
RPGWEHSNKLGYLVSPPQQIRRGERSCYRSINRGRHHSERSQRT 
QGPSLPATPVFVPVPPPP^YPPPPHTLPLPPGVPPPQFSPQFPP 
GQ P \ P PAG YS VP PPG F P PAPANLS TPWVS SG VQTAHSNT 2 PTTQ 
APPLSREEFYREQRRLKEEEKKKSKLDEFTNDFAKELMEYKKIQ 
KERRRSFSRSKSPYSGSSYSRSSYTYSKSRSGSTRSRSYSRSFS 
RSHSRSYSRSPPYPRRGRGKSRNYRSRSRSHGYHRSRSRSPPYR 
RYHSRSRSPQAFRGQSPNKRNVPQGETEREYFNRYREVPPPYDM 
KAYYGRSVDFRDPFEKERYREWERKYREWYEKYYKGYAAGAQPR 
PSANRENFSPERFLPLNIRNSPFTRGRREDYVGGQSHRSRNIGS 
NYPSKLSARDGHNQKDNTKSKEKESEItfAPGDGKGNKHKKHRKRR 
KG3ESEGFLNPELLETSRKSREPTGVEENKTDSLFVLPSRDDAT 
PVRDEPMDAESITFKSVSEKDKRERDKPIOiKGDKTKRKNDGSAV 
3KI r rVKPAKGPQEKVDG\DVRDLLDLNL\QLKKPKEETPKDL 
TILNHHLPLRRMKKSL\EPP\EKLTLIJQQK\TPRNKTSQRGKSE 
EGLFQRCQIRKANN 


5977 


1363 


1336 


FLEDRGQVLSHFQCLSLHSINHILHPGAGVAAGPATGW/REYLT 
PVLKESKFKETGVITPEEFVAAGDHLVHHCPTWQWATGEELKVK 
AYLPTGKQFLVTKNVPCYKRCKQMEYSDELEAIIEEDDGDGGWV 
DTYHNTG I TGI TEAVKE I TLENKENIRLQDCSALCEEEEDEDEG 
3AADMEEYEESGLLETDEATLDTRKIVEACKAKTDAGGEEAILQ 
TRTYDLYITYDKYYQTPRLWLFGYDEQRQPLTVEHMYEDISQDH 
VKKTVT I ENHPHLPPPPMCS VHPCRHAEVMKKI I ETVAEGGGEL 




160 


3213 


RDGARRWGGCCSPLTWAPGFYRRFDLATSGRRLRGQTAEPAGRQ " 

RPRREPEAMDEQSVESIAEVFRCFICMEKLRDARLCPHCSKLCC 

FSCIRRWLTEQRAQCPHCRAPLQLRELVNCRWAEEVTQQLDTLQ 

LCSLTKHEENEKDKCENHHEKLSVFCWTCKKCICHQCALWGGMK 

GGHTFKPLAEIYEQHVTKVNEEVAKLRRRLMELISLVQEVERNV 

EAVRNAKDERVREIRNAVEMMIARLDTQLKNKr.ITLMGQKTSLT 

QETELLESLLQEVEHQLRSCSKSELISKSSEILMMFQQVHRKPM 

ASFVTTPVPPDFTSELVPSYDSATFVLENFSTLRQRADPVYSPP 

LQVSGLCWRLKVYPDGNGWRGYYLSVFLELSAGLPETSKYEYR 

VEMVHQSCNDPTKNIIREFASDFEVGECWGYNRFFRLDLLANEG 

YLNPQNDTVILRFQVRSPTFFQKSRDQHWYITQLEAAQTSYIQQ 

INNLKERLTIELSRTQKSRDLSPPDNHLSPQNDDALETRAKKSA 
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SEQ 
NO: 


Predicted 

nucleotide 

location 

corresponding 

amino acid 


Predicted end 
nucleotide 

cor re spending 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E = 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamir.e , R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








CSDMLLER\GPYSAS\VREAKEDEEDEEKIQNEDYHEELSDGDL 
DLDLVYEDE VNQLDGS S S 3AS STATSNTEEHDI DEETMSGENDV 
EYNNMELEEGELMEDAAAAGPAGSSHGYVGSSSRISRRTHLCSA 
ATSSLLD IDPLI LIHLLDLKDRSS I ENLWGLQPRPPASLLQPTA 
SYSRIQKDQRKQQAMWRVPSDLKMLKRLKTQMAEVRCMKTDVKN 
TLSEIKSSSAASGDMQTSLFSADQAALAACGTEN3GRLQDLGME 
LLAKS3VANCYIRNSTNKKSNSPKPARSSVAGSLSLRRAVDPGE 
NSRSKGDCQTLSEGSPGSSQSGSRHSSPRALIHGSIGDILPKTE 
DRQCKALDSDAVWAVPSGLPAVEKRRKMVTLGANAKGGHLEGL 
QMTDLENNSETGELQPVLPEGASAAPEEGMSSDSDIECDTENEE 
QEEHTSVGGFHDSFMVMTQPPDEDTHSSFPDGEQIGPEDLSFNT 
DENSGR 


5979 


212 


3 665 " 


LPDMTMYLWLKLLAFGFAFLDTEVFVTGQSPTPSPTDAYLNASE 
TTTLSPSGSAVISTTTIATTPSKPTCDEKYANITVDYLYNKETK 
LFTAKLNVNEHVECGNNTCTNNEVHNLTECKNASVSISHNSCTA 
PDKTLILDVPPGVEKVPVHCCS\QVEQPDST1WLKWKNIETSTC 
DTQNITYRFQCGNMIFDNKEIKLENLEPEHEYKCDSEILYNSHK 
FTNASKIIKTDFGSPGEPQIIFCRSEAAHQGVITWNPPQRSFHN 
FTLCYIKETEKDCLNLDKNLIKYDLQNLKPYTKYVLSLHAYIIA 
KVQRNGSAAMCHFTTKSAPPSQVWNMTVSMTSDNSMHVKCRPPR 
DRNGPHERYHLEVEAGNTLVRNESHKMCDFRVKDLQYSTDYTFK 
AYFHNGDYPGEP F I LHHSTS YNS KAL IAFLAFL 1 1 VTS I ALL W 
LYKIYDLHKKRSCNLDEQQELVERDDEKQLMNVEPIHADILLET 
YKRKIADEGRLFLAEFQSZPRVFSKFPIKEARKPFWQNKNRYVD 
ILPYDYNRVELSEINGDAGSNYIKASYID3FKEPRKYIAAQGPR 
DETVDDFWRMIWEQKATVIVMVTRCEEGNRNKCAEYWPSMEEGT 
RAFGECCCKDLTKHKRCP\DYIICKLWIVNKKEKATGREVTHIQ 
FTSWPDHGVPEDPHLLLKLRRRVWAFSNFFSGPIWHGSAGVGR 
TGTYIGIDAMLEGLEAENKVDVYGYWKLRRCRCLMVQVEAQYI 
DIHQALVEYNQFGETEVNLSELHPYLHKMKKRDPPSEPSPLEAE 
FQRLPSYRSWRTQHIGNQE\ENKSKNRNSMVIPYDYNRVPLKHE 
LEMSKESEHDSDESSDDDSDSEEPSKYINASFIMSYWKP\EVMI 
AAQGPLKETIGDFWQMIFQRKVKVIVMLTELKHGDQEICAQYWG 
EGKQTYGDIEVDLKDTDKSSTYTLRVFELRHSKRKDSRTVYQYQ 
YTNWSVEQLPAEPKELISMIQWKQKLPQXNSSEGNKHHKSTPL 
LIHCRDGSQQTGIFCALLNLLESAETEEWDIFQWKALRKARP 
GE1VSTFEQYQFLYDVIASTYPAQNGQVKKNNHQEDKIEFDNEVD 
KVKQDANCVNPLGAPEKLPEAKEQAEGSEPTSGTEGPEHSVUGP 
ASPALNQGS 


5980 


3 


2353 


DAWGCKLRRLRFTYGTQTRVSLALPGQYELVHTLVAHQGNWETI " 

PEEDLEVQ3NNEDAAHDLTELEVTMHEALLQEVDWVAPCQGLR 

PTVDVLGDLVNDFLPVITYALHKDELSERDEQELQEIRKYFSFP 

VFFFKVPKLGSE 1 1 DS STRRMESERS PLYRQLIDLGYLSSSHWN 

CGAPGQDTI^AQSMLVEQSEKLRHLSTFSHQVLQTRLVDAAKALN 

LVHCHCLDIFINQAFDMQRDLQITPKRLEYTRKKENELYESLMN 

IANRKQEEMKDMIVETLNTMKEELLDDATNMEFKDVIVPENGEP 

VGTREIKCCIRQIQELIISRLNQAVANKLISSVDYLRESFVGTL 

ERCLQSLEKSQDVSVHITSNYLKQILNAAYHVEVTFHSGSSVTR 

SICSQFRTRLNSSHEAFAASLRQLEAGHSGRLEKTEDLWLRVRK 
DHAPRLARLS LES RS LQD VLLHRKP KLGQELGRGQ YG WYLCDN 
WGGHFPCALKSWPPDEKHWNDLALEFHYMRSLPKHERLVDLEG 
SVIDYNYGGGSSIAVLLIMERLHRDLYTGLKAGLTLETRLQIAL 
DWEGIRFLHSQGLVHRDIKLKNVLLDKQNRAKITDLGFCKPEA 
MMSGSIVGTPIHMAPELFTGKYDNSVDVYAFGILFWYICSGSVK 
LPEAFERCASKDHLWNNVRRGARPERLPVFDEECWQLMEACWEG 
DPLKRPLLGIV2PMLQGIMNRLCKS\NSEQPNRGLDDST 


5981 




2519 


GR K -\SAAMERPWGAADGLSRWPHGLGLLLLLQLLPPSTLSQDRL 
DAPPPPAAPLPRW£GPIGVSWGLRAAAA\GGAFPRGGRWRRSAP 
G \ EDEECGRVRDFVAKLANNTHQHVFDDLRGSVSLSWVGDS TGV 
ILV^TTFHVPLVIMTFGQSKLYRSEDYGKNFKDITDLINNTFIR 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

amino acid 


Predicted 2r.c 
nucleotide 
location 
corresponding 
to first 

residue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=As?artic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








TEFGMAIGPENSGKVVLTAEVSG3SRGGRIFRSSDFAKNFVQTD 
LP FHPLTQMMYS PQN3 DYLLALSTEKGLWVSKNFGGKWEE IHKA 
VCMXKGSDNTIFFrTYANGSCKADLGALELWRTSDLGKSFKTI 
GVKIYSFGLGGRFLFASVMADKDTTRRIHVSTDQGDTWSMAQLP 
SVGQEQFYSILAANDDMVFMHVDEPGDTGFGTIFTSDDRGIVYS 
KS LDRH LYTTTGGE TD FTNVTS LRGVY I TS VLS EDNS I QTM I TF 
DQGGRWTHLRKPENSECDATAKNKNECSLHIHASYSISQKLNVP 
MAPLSEPNAVGIVIAHGSVGDAISVMVPDVYISEDGGYSWTKML 
EGPHYYTZLDSGGIIVAIEHSSRPINVIKFSTDEGQCWQTYTFT 
RDPIYFTGLASEPGARSMNISIWGFTESFLT3QWVSYTIDFKDI 
LERNCEEKDYTIWLAHSTDPEDYEDGCILGYKEQFLRLRKSSVC 
QNGRDYWTKQPSICLCSLEDFLCDFGYYRP3NDSKCVEQPELK 
GHDLEFCLYGREEHLTTNGYRKIPGDKCQGGVNPVREVKDLKKK 
CTSNFLSPEKQNSKSNSVPIILAIVGLMLVTWAGVLIVKKYVC 
GGRFLVHLYSVLQQH\AEA\NGVDGVDALDTASHTNKSGYHDDS 
DEDLLE 


5982 


56 


2316 


ATR ? PROS S WCRQFSRTASAAPGRSNMLR I PVRKALVGLS KS PK 
GCWTTATAASNLIEVFVDGQSVMVEPGTTVLQACEKVGMQIPR 
FCYHERLSVAGNCRMCLVEIEKAPKWAArAMPVMKGWNILTNS 
EKSKKAREGVMEFLLANHPLDCPICDQGGECDLQDQSMMFGNDR 
SKKLEGKRAVEDKNIGPLVKTIMTRCIQCTRCIRFASEIAGVDD 
LGTTGRGNDMQVGTY1EKMFMSELSGNIIDICPVGALTSKPYAF 
TARPWETRKTESIDVMDAVGSNIWSTRTGEVMRILPRMHEDIN 
EEWISDITP- I 3 i itPMVR^EKGLLTYTSWEDALSRV 
AGMLQ 3 FQG KD VAAI AGGLVDAEALVALKDLLNRVDS D'l'LCTEE 
VFPTAGAGTDLRSNYIjLNrriAGTOEADWLLTCTNPFKRAPLF 
NARIRKSWLHNDLKVALIGSPVDLTYTYEHLGDSPKILQDIASG 
SHPFSQVLKEAKKPMWLGSSALQRNDGAAILAAVSSIAQKIRM 
TSGVTGDWKVMNILHRIASQVAALDLGYKPGVEAIRKNPPKVLF 
LLGADGGCITRQDLPKDCFIIYQGHHGDVGAPIADVILPGAAYT 
EKSATYVNTEGRAQQTKVAVTPPGLAREDWKIIRALSEIAGMTL 
P YDTL \DQVRNRLEEVS PKLVR YDD I EG \ ANY FQQANELS KLVW 
3QLLADPLVPPQLTMKDFYMTDS I SRASQTMAKCVKAVTEGAQA 
VEEPSIC 


59B3 


248 


1763 


EAxGDGGRRRHRASGRRAGRGEP \AGLKSQGQRAVPKRAVARGG 
RQ \ YSAAI ALLEPAGSE IADDLS I LYSWRAACYLKEGNCSGC I Q 
DCKRALELHPFSMKPLLRRAM.AYETLEQYGKAYVDYKTVLQIDC 
GLQLANDSVNRLSRILMELDGPNWREKLSLIPAVPASVPLQAWH 
PAXEM I SKQAGDSS SHRQQGI TDE KTFKALKEEGWQCVNDKNYK 
DALSKYSECLKINNKECAIYTNRALCYLKLCCFEEAKQDCDQAL 
QLADGIIVKAFYRRALAHKGLKNYQKSLIDLNKVILLDPS I IEAK 
KELEEVTRLLNLKDKTAPFNKEKERRKIEIQEVNEGKEEPGRPA 
GEVSTGCLASEKGGKSSRSPEDPEKLPIAKPNNAYEFGQIINAL 
STRKDKEACAHLLAITAPKDLPMFLSNKLEGDTFLLLIQSLKNN 
LIEKDPSLVYQHLLYLSKAERFKMMLTLISKGQKELIEQLFEDL 
SDTPNNHFTLEDIQALKRQYEL 


5984 


755 


1193 


SSVCMACTi VSKLGKKQRSVS FLASGLMRVSTGPELRLHHSFVL 
TGDVGRRICRLLVGLFTKGDTSSKRVHPFSPGPCFLLCDLARVG 

FHHSLP.SISRFSSG 


5985 


22 


1408 


RRVAR PGTAE P AKARRTVRRGRARRDLAGAERKAGVS ERGDS GR 
RRPNPSIPSAAAGMSH1Q1PPGLTELLQGYTVEVLRQQPPDLVE 
FAVEYFTRLREARAPASVLPAATPRQSLGHPPPEPGPDRVADAK 
GDSESEEDEDLEVPVPSRFNRRVSVCAETYNPDEEEEDTDPRVI 
ffl . 1 L HQRCRLQEACKDILLFKtfLDQEQLSQVLDAMFERIVKAD 
EHVIDQGDDGDNFYVI ERGTYDI LVTKDNQTRS VGQYDNRGS FG 
ELALMYNTPRAATIVATSEGSLWGLDRVTFRRIIVKNIJAKKRKM 
FESFIESVPLLKSLEVSERMKIVDVIGEKZYKR/DGERIITQGE 
K\ADSFYI IESGEVS I LIRSRTKSNKDGGNQEVE 1ARCHKGQYF 
3ELALVTNKPRAASAYAVGDVKCLVMDVQAFERLLGPCMDIMKR 
NISHYEEQLVKMFGSSVDLGWLGQ 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, OCysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 

W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 


5986 


180G 


484 


3AWKSTSLTFHWKLWGRHRGRRRGLAHPKNHLSPQQGGATPQVP 
SPCCRFDSPRGPPPPRLGLLGALMAEDGVRGSPPVPSGPPMEED 
GLRWTPKSPLDPDSGLLSCTLPNGFGGQSGPEGERSLAPPDASI 
LISNVCSIGDHVAQEI,FQGSDLGI4AEEAERPGEK\AGQHSPLRE 
EHVTCVQSILDEFLQT\YGSLIPLSTDEWEKLEDIFQQEFSTP 
SRKGLVLQLIQSYQRMPGNAMVRGFRVAYKRHVLTMDDLGTLYG 
QNWL?TDQVMNMYGDLVMDTVPEK\VHFFNSFFY\DKLRTKGYDG 
VKRWTKNVD I FNKELLL IPIHLEVJWSLI S VDVRRRTI TYFDSQ 
RTLNRRCPKHIAKYLQAEAVKKDRLDFHQGWKGYFKMNVARQNN 
DSDCGAFVLQYCKHLALSQPFSFTQQDMPKLRRQIYKELCHCKL 
TV 




18CG 


... 


DAWKSTSLTFHWKLWGRHRGRRRGIAHPKNHLSPQQGGATPQVP 
SPCCRFDSPRGPPPPRLGLLGALMAEDGVRGSPPVPSGPPMEED 
C-LRWTPKSPLDPDSGLLSCTLPNGFGGQSGPEGERSLAPPDASI 
L I SNVCS IGDHVAQELFQGSDLGMAEEAERPGEK\AGQHS PLRE 
EHVTCVQSILDEFLQT\YGSLIPLSTDEWEKLEDIFQQEFSTP 
SRKGLVLQLIQSYQRMPGNAMVRGFRVAYKRHVLTMDDLGTLYG 
QNWLNDQVMNMYGDLVMDTVPEK\VHFFNSFFY\DKLRTKGYDG 
VKRWTKNVDIFNKELLLIPIHLEVHWSLISVDVRRRTITYFDSQ 
RTLNRRCPKHIAKYLQAEAVKKDRLDFHQGWKGYFKMNVARQNN 
DSDCGAFVLQYCKHLAT.SQPFSFTQQDMPKLRRQIYKELCHCKL 
TV 


5988 


1292 


410 


FKKY FLS FLGLLES SHSRDR I HNLVLMFLLATHNLiVWWFTCR FQ 
RLDCIYLNAGIMPNPQLNIKALLFGLFS\AEGLLTQGDKITADG 
LQEVFETDVFGIIFILIRELEPLLraSDNPSQLIWTSSRNARKSN 
FSLEDFQHSKGKEPYSSSKYATDLLSVALNRNFNQQGLYSNVAC 
PGTALTNLTYGILPPFIWTLLMPAILLLRFFANAFTLTPYNGTE 
ALVWLFHQKPESLNPLIKYLSATTGFGRNYIMTQKMDLDEDTAE 
KFYQKLLELEKHIRVTIQKTDNQARLSGSCL 


59 8 9 




2610 


M4DFPQHSQHVLEQLNQQRQLGLLCDCTFWDGVHFKAHKAVLA 
ACSEYFKMbFVDQKDWHLDISNAAGLGQVLEFMYTAKLSLSPE 
NVDDVIAAVATFLQMQDI I TACHALKSLAEPATSPGGNAEALAT 
EGGDKRAKEEKVATSTLSRLEQAGRSTPIGPSRDLKEERGGQAQ 
SAASGAEQTEKADAPREPPPVELKPDPTSGMAAAEAEAALSESS 
EQEMEVEPARKGEEEQKEQEEQEEEGAGPAEVKEEGSQLENGEA 
PEENENEESAGTDSGQELGSEARGLRSGTYGDRTESKAYGSVIH 
KCSDCGKEFTHTGNFKRHIRIHTGEKPFSCRECSKAFSDPAACK 
AHEKTHSPLKPYGCEECGKSYRLISLLNLRKKRHSGEARYRCED 
CGKLFTTSGNLKRHQLVHSGEKPYaCDYCGRSFSDPTSKMRHLE 
-HD.DKEHKCPHCDKKFNQVGNLKAHLKIHIADGPLKCRECGKQ 
FTTSGNLKRHLRIHSGEKPYVCIHCQRQFADPGALQRHVRIHTG 
EK ""'CVMCGKAFTQASSLIAHVRQHTGEKPYVCERCGKRFVQS 
SQLANHIRHHDNIRPHKCSVCSKAFVNVGDLSKHIIIHTGEKPY 
LCDKCGRGFNRVDNLRSHVKTVHQGKAGIKILEPEEGSEVSWT 
VDDMVTLATEALAATAVTQLTWPVGAAVTADETEVLKAEISKA 
VKQVQEEDPNTHILYACDSCGDKFLDANSLAQHVRIHTAQALVM 
FQTDADFYQQYGPGGTWPAGQVLQAGELVFRPRDGAEGQPALAE 
TSPTAPECPPPAE 


5990 


2 


4700 


FGPGPDSGGGARGSGWGSRSQAPYGTLGAVSGGEQVLLHEEAGD 
£ :t . JLSRLGPSLRDKDLEMEELMLQDETLLGTMQSYMDASLIS 
LIEDJGSLGEVEMSLPDPSWDFSPPSFLETSSPKLPSWRPPRSR 
PRWGQS PPPQQRSDGEEEEEVAS FSGQILAGELDNCVSS I PDFP 
MHLACPESEDKATAAEMAVPAAGDESISSLSELVRAMHPYCLPN 
LTHLASLEDELQEQPDDLTLPEGCWLEIVGQAATAGDDLEIPV 
.' VRQVS PGPRPVLLDDSLETSSALQLLMPTLESETEAAVPKVTL 
C,'L':.S3LSLNSEEKLDSACLLKPREWEPWPKEPQNPPANAAP 
GSQRARKGRKKKSKEQPAACVEGYARRLRSSSRGQSTVGTEVTS 
"'HI 'J"CPQEELQKESGPLQGKGKPRAWARAWAAALENSSPKN 
LERSi 3CSSPAKEGPLDLYPKLADTIQTNP1PTHLSLVDSAQAS 
PMPVDSVEADPTAVGPVLAGPVPVDPGLVE LASTS SELVEPLPA 
EPVLINPVLADSAAVDPAWPISDNLPPVEIAVPSGPAPVDIALV 
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jrresponding 
> first 



corre spondir 
to first ^ 

residue of 

sequence 



Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, X-Unknown, *=Stop 
Codon, /'=po33ible nucleotide deletion, 
\=possible nucleotide insertion) 
DPVPNDLTPVDPVLVKSRPTDPRRGAVSSALGGSAPQLLVESES 
LDPPKTIIPEVKEWDSLKIESGTSATTHEARPRPLSLSEYRRR 
RQQRQAETEERSPQPPTGKWPSLPETPTGLADIPCLVIPPAPAK 
KTALQRSPETPLEICLVPVGPSPASPSPEPPVSKPVASSPTEQV 
PSQEMPLLARP3PPVQSVSPAVPTPPSMSAALPFPAGGLGMPPS 
LPPPPLQPPSLPLSMGPVLPDPFTHYAPLPSWPCYPEVSPSGYP 
CLPPPPTVPLVSGTPGAYAVPPTCSVPWAPPPAPVSPYSSTCTY 
GPLGWGPGPQHAPFWSTVPPPPLPPASIGRAVPQPKMESRGTPA 
GPPENVLPLStlAPPLSLGLPGHGAPQTEPTKVEVKPVPASPHPK 
HKVSALVQSPQMKALACVSAEGVTVEBPASERLKPETQETRPRE 
KPPLPATKAVPTPRQSTVPKLPAVHPARLRKLSFLPTPRTQGSE 
DWQAFISEIGIEASDLSSLLEQFEKSEAKKECPPPAPADSLAV 
GNSGGVDIPQEXRPLDRLQAPELAWAGLTPPATPPHQLWKPLA 
AVSLLAKAKSPXSTAQEGTLKPEGVTEAKHPAAVRLQEGVHGPS 
RVHVGSGDHDYC\VRSRTPPKK\MPAIjLIPEVGSRWKVKRHQDI 
TIKPVLSLGPAAPPPPCIAASREPLDHRTSSEQADPSAPCLAPS 
SLLSPEASPCRNDMNTRTPPEPSAKQRSMRCYRKACRSASPSSQ 



PSPRRRSDRRRRYSSYRSHDHYQKQRVLQKERAIEERRWFIGK 
I PCRXTRS EL KQRFSVFGE I EECT IIIFRVQGDNYGFVTYRYAEE 
AFAAIESGHKLRQADEQPFDLCFGGRRQFCKESYSDLDSNREDF 
nPAPVKSKFDSI.DFDTr.LKQAQKNLRR 



rlsshfsqcspsiycXtkfdkqgnvtsferkktelyqelgijQar 
dlrfqhvmsitvrnnriimrmeylkavitpecllildyrnlnlk 

QWLFRELPSQLSGEGQLVTYPLPFEFRAIEALLQYWINTLQGKL 
SILQPLILETLDALGDPKHSSVDRSKLHILLQNGKSLSELETDI 
KIFKESILEILDEEELLEELCVSKWSDPQVFEKSSAGIDHAEEM 
ELLLENYYRLADDLSNAARELRVL I DDSQS 1 1 F INLDSHRWVMM 
RLNLQLTMGTFSL3LFGLMGVAFGMNLESSLEEDHRIFWLITGI 
MFMGSGLIWRRLLS FLGR/LARS S I AS YGMKDMVHGG I VEGL 



AGPDFRLVCGVSGSGFPGGRQGQATEWRPLRPWNGAMEKLRRVL 
SGQDDEEQGLTAQDSQINL/SEVLDASSLSFNTRLKWFAICFVC 
GVFFS I LGTGLLWLPGG I KLFAVFYTLGNLAALASTCFLMGP VK 
QLKKMFEATRLLATIVMLLCFIFTLCAALWWHKKGLAVLFCILQ 
FLSMTWYSLSYIPYARDAVIKCCSSLLS 



AEGLGSWAVWAGLGWAGRHMEAGGATGALGVGCKLPSAFCFPGS 
SVAMDMFQKVEKIGEGTYGWYKAKNRETGQLVALKKIRLDLEM 
EGVPSTAIREISLLKELKHPWIVRLLDWHNERKLYLVFEFLSQ 
DLKKYMDSTPGSE1PLHLIKSYLFQLLQGVSFCHSHRVIHRDLK 
PQNLL INELGA I KLAD FGLARAFGVPLRTYTHE WTLW YRAP E I 
LLATRFYTTAVDINSIGCIFAEMVTRKALFPGDS\EIDQ\LFRI 
FRMLGTPSEDTWPGVTQLPDYKGSFPKWTRKGLEEIVPNLEPEG 
RDLLMQLLQY'DPSQRITAKTAljAHPYFSSPEPSPAARQYVljQRF 
RH 



AGEVQLHVWIRGMRIQPQ/KAAAIIDLDPDFEPQSRPRSCTWPL 
PRPEIANQPSKPPEVEPDLGEKVHTEGRSEPILLPSRLPEPAGG 
PQPGILGAVTGPRKGGSRRNAWGNQSYAELISQAIESAPEKRLT 
LAQIYEWMVRTVPYFKDKGDSNSSAGWKNSIRHNLSLHSKFIKV 



PKKKPSGLPAPPEGATPTSPVGHFAKWSGSPCSRNREEADMWTT 
FRPRSSSNASSVSTRLSPLRPESEVLAEEIPASVSSYAGGVPPT 
LNEGLELLDGLNLTSSHSLLSRSGLSGFSLQHPGVTGPLHTYSS 
SLFSPAEGPLSAGEGCFSSSQALEALLTSDTPPPPADVLMTQVD 
PILSQAPTLLLLGGLPSSSKLATGVGLCPKPLEAPGPSSLVPTL 
SMIAPPPVMASAPIPKALGTPVLTPPTEAASQDRMPQDLDLDMY 

MENLECJMDyilSDLMDEGEGLDFNFEPDP 

RPPGPGPASGAWLCTRARGSAAFVPPLPRPPSRGARRRRRLPGR 
GVAALRRGPGSAPGLPRGRAERSAAGSGRGPSREERGAAAAAAA 
AEMMEELHSL\DP\RRQELLEARF\TGLGVSKGPLNSESSNQSL 
CSVGSLSDKEVETPEKKQNDQRNRKRKAEPYETSQGKGTPRGHK 
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SEQ 
ID 


Predicted 

beginning 

nucleotide 

location 

corresponding 

residue of 
amino acid 
sequence 


Predicted end 

corresponding 

amino acid 
residue of 

sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine , R=Arginine, 
S=Serine, T=Threonine , V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








ISDYFERRVEQPLYGLDGSAAKEATEEQSALPTLMSVMLAKPRL 
DTEQLAQRGAGLCFTFVSAQQNSPSSTGSGNTEHSCSSQKQISI 
QKSQT\QSDLTIEKISALENSKNSDLEKKEGRIDDLLRANCDLR 
RQI\DEQQKML,EKYK\ERLNRCFDNEPRNFLIEKSKQEKMACRD 
KSMQDRLRLGHFTTVRHGAS FTEQWTDGYAFQNLI KQQERINS Q 
REEIERQP.KMIAKRKPPAMGQAPPATNEQKQRKSKTNGAENETL 
TLAEYHEQEEIFKLRLGHLKKEEAEIQAELERLERVRNLHIREL 
KRIHNEDNSQFKDHPTLNDRYLLLHLLGRGGFSEVYKAFDLTEQ 
RYVAVKIHQLNKNWRDEKKENYHKHACREYRIHKELDHPRIVKL 
YDYFSLDTDSFCTVLEYCEGNDLDFYLKQHKLMSEKEARSIIMQ 
I VNALKYLNE IKPPII HYDLKPGIII LLVNGTACGE I KITDFGLS 
KIMDDDSYNSVDGMELTSQGAGTYWYLPPECFWGKEPPXISNK 
VDVWS VGVI FY QCLYGRKP FGHNQS QQD I LQEMTILKATEVQFP 
PKPWTPEAKAFIRRCLAYRKEDRIDVQQLACDPYLLPHIRKSV 
STSSPAGAAIASTSGASNNSSSN 


5996 


-612 


981 


DQQACLLGLMLTLEFGILEFDPSWIG3WTQR/SWVSWRSRPGCE 
LFSIWFGSIVNEGYLNSASEGEEFCIYNRI'TPNACSYGVAVGVL 
AFLTCLLYLALDVYFPQISSVKDRKK\AVLSGHPWSGEPHPAA 
FWAFLWFTGDSCYL\ANQWQVSKPKDNPLNEGTDASPGRPSPFS 
FFSIFTWSLTAALAVP.RFKDLSFQEEYSTLFP\ASAQP 


5997 


1612 


981 


BQQACLLGLMl.Tl.EFGTT.EFDPSWIGSWTQR/SWVSWRSRPGCE 
LFS I WFGS I VNEG YLNSASEGEE FC I YNRNPNACS YGVAVQVL 
AFLTCLLYLALDVYFPQISSVKDRKK\AVLSGHPWSGEPHPAA 
FWAFLWFTGDSCYL\ANQWQVSKPKDNPLNEGTDASPGRPSPFS 
FFSIFTWS LTAALAVRRFKDLSFQEE YSTLFP \ AS AQ P 


5998 


1612 


981 


I LLC-LMLTLE F 1 1 fit 3 E 
LFSIWFGSIVKEGYLNSASEGEEFCIYNRNPNACSYGVAVGVL 
AFLTCLLYLALDVYFPQISSVKDRKK\AVLSGHPWSGEPHPAA 
FWAFLWFTGDSCYL\ANQWQVSKPKDNPLNEGTDASPGRPSPFS 
FFS I FTWSLTAALAVRRFKDLSFQEE YSTLFP \ASAQP 


5999 


2 


1790 


rppmekarrggdgvprgpvlhiwvgfhhkkgcqvefsypplip 
gdghdshtlpeewkylpflalpdgahnyqedtvffhlpprngng 
atvfg i scyr\ ql eakalkvrqad itretvqks vcvlsklplyg 
llqaklqlithayfeekdfsqisilkelyehmnsslggaslegs 
qvylglsprdlvlhfrhkglilfklillekkvlfyispvnklvg 
almtvlslfpgmiehglsdcsqyrprksmsedgglqesnpcadd 
fvsastadvshtnlgtirkmagnhgedaamkteeplfqvedss 
kgqepndtnqylkppsrpspdssesdwetldpsvledpnlkere 
qlgsdqtnlfpkdsvpseslpitvqpqantgqwlipglisgle 
edqyghplaiftkgylclpymalqqhhllsdvtvrgfvagatni 
lfrqqkhlsdaiveveealiqihdpelrkllnpttadlrfadyl 
vrhvtenrddvfldgtgweggdewiraqfavyihallaatlqlv 
lfri vkvakki gnvmvtt\ srnwqtgk\avgqsvggafs \ sak 
ta\n:sswlstfttstsqslte?pdekp 


6000 


101 


1561 


TEPCRTAENCTATMSENNKNSLESSLRQLKCHFTWNLMEGENSL 
DDFEDKVFYRTEFQNREFKATMCNLLAYLKHLKGQNEAALECLR 
KAEELIQQEHADQAEIRSLVTWGNYAWVYYHMGRLSDVQIYVDK 

ALEKKPKNPEFTSGLAIASYRLDNWPPSQNAIDPLRQAIRLNPD 
NQYLKVLLALKLHKMREEGEEEGEGEK\LVEEALEKAPG\VrDV 
LRSAA\KFYRGKDEPDKAIELLKKALEYIP\NNAYLHCQIGCCY 
RAKVFQVMMLRENGMYGKRKLLELIGHAVAHLKKADEANDNLFR 
VCSILASLHALADQYEDAEYYFQKEFSKELTPVAKQLLHLRYGN 
FQLYQMKCEDKAIHHFIEGVKINQKSREKEKMKDKLQKIAKMRL 
SKNGADSEALHVLAFLQELNEKMQQADFDSERGLESGSLIPSAS 
SWNGE 


6001 


176 


1038 


AFAKSPSRGHRHTHIHTPRHTPRCTMAESHLQSSLITASQFFEI 
WLHFDADGSGYLEGKELQNL1QELQQARKKAGLELSPEMKTFVD 
QYGQRDDGKIGIVELAHVLPTEENFLLLFRCQQLKSCE\EFMKT 
WRKYDTDHSGFIETEELKNFLKDLLEKANKTVDDTKLAEYTDLM 
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ID 
NO: 


beginning 
corresponding 

sequence 


nucleotide 

corresponding 
to first 
amino acid 
residue of 

sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, ^Phenylalanine, G=Glycine, 
H=Histidine, I=Iscleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=3erine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








LK_iFDSNMDGKLELTEMARLLPVQENFLLKFQGIKMCGKEFNKA 
FELYDQDGNGYIDENELDALLKDLCEKNKQDLDINNITTYKKNI 
MALSDGGKLYRTDLALILCAGDN 


S002 


977 


81 


LAPPGGGLHIPPRTPLSHSRPPPSHHAPHPSPLPLPPADLHPHS 
SMAQRSDLLELDCQLTRDRVVVVSHDENLCRQSGLNRDV3SLDF 
EBLPLYKEKLHVYFSPGHFAHGSDRRMVRLEDLFQRFPRTPMSV 
EIKGKNEELIREC/VLVRRYDRNEITIWASEKSSVMKKCKAANP 
EMPLSFTISRGFWVLLSYYLGLLPFIPIPEK7FFCFLPNIINRT 
YFPFSCSCLNQLLAWSKWLIMRKSLIRHLEERGVQWFMCLNE 
ESDFEAAFSVGATGVITDYPTALRHYLDNHGPAARTS 


6003 


140 


4098 


GKLRAFRGMRRLICKRICDYKSFDDEESVDGNRPSSAASAFKVP 
APKTSGNPANSARKPGSAGG P KVGAGAS KEGGAGAVDEDDF I KA 
FTDVPSIQIYSSRELEETLNKIREILSDDKHDWDQRAWALKKIR 
SLLVAGAAQYDCFFQHLRLLDGALKLSAKDLRSQWREACITVA 
HLSTVLGNKFDHGAEAIVPTLFNLVPNSAKVMATSGCAAIRFII 
RHTHVPRL I PLITSNCTS KSVPVRRRS FE FLDLLLQEWQTHSLE 
RHAAVLVE T I KKG I HDADAE ARVE ARKT YMGLRNH FPGEAETLY 
NSLEPSYQKSLQTYLKSSGSVASLPQSDRSSSSSQESLNRPFSS 
KWSTANPSTVAGRVSAGSSKASSLPGSLQRSRSDIDVNAAAGAK 
AHHAAGQSVRSGRLGAGALNAGSYASLEDTSDKLDGTASEDGRV 
RAKLSAPLAGMGNAKADSRGRSRTKMVSQ53QPGSRSGSPGRVLT 
TTALSTVSSGVQRVLVNSASAQKRSKIPRSQGCSREASPSRLSV 
ARSSRIPRPSVSQGCSREASRESSRDTSFVRSFQPLASRHHSRS 
TGALYAPEVYGASGPC-YGISQSSRLSSSVSAMRVLNTGSDVEEA 
VADALLLQDIRTKKKPARRRYESYGMHSDDDANSDASSACSERS 
YSS?.NGSIPTYMRQT\EDV\AEVLNRCASSNWSERKEGLLGLQN 
LLKNQRTLSRVELKRLCEIFTRMFADPHGKRVFSMFLETLVDFI 
2VEKDDLQDWLFVLLTQLLKKMGADLLGSVQAKVQKALDVTRES 
FPNDLQFNILMRFTVDQTQTPSLKVKVAILKYIETLAKQMDPGD 

fikssetrlavsrvitwttepkssdvrkaaqsvlislfelntpe 
ftmllgalpktfqdgatkllhnhlrntgngtqssmgspltrptp 
rseanwsspltsptntsqntl3psafdydtekmnsediysslrg 
vteaiqnfsfrsqedmneplkrdskkddgdsmcggpg\msdpra 
ggdatdssqtal\dnkasllhsmpthssprsrdynpynysdsis 
pfnksalkeakfdddadqfpddlsldhsdlvaellkelsnhwer 
veerkialyelmkltqeesfsvwdehfktillllletlgdkept 
iraiaalkvlreilrhqparfknyaeltvmktleahkdphkevvr 
saeeaasv\lats-\speqcikvlcpiiqtadypimlaaikmqt 
kviervsketlnlllpeimpgliqgydnsessvrkacvfclvav 
havigdelkphlsqltgskmkllnlyikraqtgsggadpttdvs 

GQS 


6004 




4098 


GKLRAFRGMRRLICKRICDYKSFDDEESVDGNRPSSAASAFKVP 
APKTSGNPANSARKPGSAC-GPKVGAGASK3GGAGAVDEDDFIKA 
FTDVPSIQIYSSRELEETLNKIREILSDDXHDWDQRANALKKIR 
SLLl'AGAAQYDCFFQHLRLLDGALKLSAKDLRSQWREACITVA 
HLiS rVLGNKFDHGAEAIVPTLFHLVPlvSAKVMATSGCAAIRFII 
RHTHVPRLIPLITSNCTSKSVPVRRRS FE FLDLLLQEWQTHSLE 
RHAWLVETIKKGIHDADAEARVEARKTYMGLRNHFPGEAETLY 
NSLEI-'SYQKSLQTYLKSSGSVASLPQSDRSSSSSQESLNRPFSS 
CWSl JIPSTVAGRVSAGSSKASSLPGSLQRSRSDIDWAAAGAK 
AHHAAGQSVRSGRLGAGALNAGSYASLEDTSDKLDGTASEDGRV 
RAKLSAPLAGMGNAKADSRGRSRTKMVSQSQPGSRSGSPGRVLT 
TTA1STVSSGVQRVLVNSASAQKRSKIPRSQGCSREASPSRLSV 
ARSSRIPRPSVSQC-CSREASRESSRDTSPVRSFQPLA3RHHSRS 
TGAV/APEVYGASGPGYGISQSSRLSSSVSAMRVLNTGSDVEEA 
VADALLLGDIRTKKKPARRRYESYGMHSDDDANSBASSACSERS 
YSSRNGS I PTYMRQT\EDV\AEVLNRCASSUWSERKEGLLGLQN 
LLKNQRTLSRVELKRLCEIFTRMFADPHGKRVFSMFLETLVDFI 
QVHKDDLQDWLFVLLTQLLKKMGADLLGSVQAKVQKALDVTRES 
F PNDLQFN I LMRFTVDQTQTPS LKVKVAI LKYIETLAKQMDPGD 
FINSSETRLAVSRVITWTTEPKSSDVRKAAQSVLISLFELNTPE 
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S3Q 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 
to first 

residue of 


Predicted end 
nucleotide 

corresponding 
to first 
amino acid 


Amine acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidins, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 

\=possible nucleotide insertion) 








ftkllgalpktfqdgatkllhnhlrntgitfgtqssmgspltrptp' 
rs panws s plts ptntsqntls psafdydtenmnsedi yss lrg 
vteaiqnfsfrsqedmneplkrdskkddgdsmcggpg\msdpra 
ggdatdssqtal\dnkasllhsm?thssprs3dynpynysdsis 
pfnksalkeamfdddadqfpddlsldhsdlvae1lkelsnhker 
veerkialye1mkltqeesfsvwdehfktil1llletlgdkept 
iralalkvlreilrhqparfkjiyaeltvmktleahkdphkewr 
saeeaas v\ lats i \ speqc i kvlcpi i qtad ypinlaai kmqt 
kviervsb:etlnlllpeimpgliqgydnsessvrkacvfclvav 
havic3delkphlsqltgskmkllnlyikraqtgsggadpttdvs 

GQS 


S005 


133 


5955 


RSSGRRQEQLGQFPGRERKGMASGLGSPSPCSAGSEEEDMDALL 
NNSLPPPHPENEEDPEEDLSETETPKLKKKKKPKKPRDPKIPKS 
KRQKKERMLLCRQLGDSSGEGPEFVEEEEEVALRSDSEGSDYTP 
GKKKKKKLGPKKEKKSKSKRKEEEEEDDDDDDDSKEPKSSAQLL 
EDWGMEDIDHVFSEEDYRTLTNYKAFSQFVRPLIAAKNPKIAVS 
KMMMVLGAKWREFSTNNPFKGSSGASVAAAAAAAVAWESMVTA 
TEVAPPPPPVEVPIRKAKTKEGKGPNARRKPKGSPRVPDAKKPK 
PKKVAPLKIKLGGFGSKRKRSSSEDDDLDVESDFDDASINSYSV 
SDGSTSRSSRSRKKLRTTKKKKKGEEEVTAVDGYETDHQDYCEV 
CQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGI 
QWEAKEDNSEGEEILEEVGGDLEEEDDHKMEFCRVCKDGGELLC 
CDTCPSSYHIHCLNPPLPEIPNGEWLCPRCTCPALKC-KVQKILI 
V.lJi.'jr.PFSPTPVPRPPDADPNTPSPKPLEGRPERQFFVKWQGME 
YWHCSWVSELQLELHC\QVMFRNYQRKNDMDEPPSGDFGGDEEK 
S\RKRKNKDPKFAEMEERFYRYGIKP3W\MMIHRILNHSVDKKG 
HVHYL I KWRDLP YDQAS WESEDVE I QDYDLFKQS YWNHRELMRG 
EEGRPGKKLKKVKLRKLERPPETPTVDPTVKYERQPEYLDATGG 
TLHPYQMEGLNWLRFSWAQGTDTILADEKGLGKTVQTAVFLYSL 
YKEGHSKGPFLVSAPLSTIIN\WERE?EMWAPDMYV\VTYVGDK 
DSPAIIRENEFS\FEDNAIRGGKKASRMKKEASVKFHVLLTSYE 
LITIDP/1AILGSIDWACLIVDEAHRLKNNQSKFFRVLNGYSLQHK 
LLLTGTPLQNNLE3LFHLLNFLTPERFHNLEGFLEEFADIAKED 
LHDMLG\PHMLRRLKADVFKNMPSKTELIV\RVELSPM\Q 
KKYYK\YILHSKFLKALN\ARGGGNQVSLLNWMDLKKCCNHPY 
LFPVAAMEAPraPNGMYDGSALIRASGKLLLLQKMLKNLKEGGH 
RVL I FSQMTKMLDLLEDFLEHEGYKYERIDGGITGNMRQEAI DR 
FNAPGAQQFCFLLSTRAGGIGTNLATADTVIIYDSDWNPHNDIQ 
AFSRAHRI3QNKKVMIYRFVTRASVEERITQVAKKKMMLTHLW 
RPC-LGSKT3SMSKQELDDILKFGTEELFKDEATDGGGDNKEGED 
SSVIHYDDB^IERLLDRNQDETEDTELQGMNEYLSSFKVAQYVV 
REEEMGEEEEVEREIIKQEESVDFDYWEKLLRHHYEQQQEDLAR 
NLGKGKRIRKQVNYNDGSQEDRDWQDDQSDNQSDYSVASEEGDE 
DFEERSEAPRRPSRKGLRNDKDKPLPPLLARVGGNIEVLGFNAR 

qrkaflnaimrygmppqdafttqwlvrdlrgksekefkayvslf 
krhlcepgadgaetfadgvpreglsrqhvltrigvmslirkkvq 
efehvngrwsmpelaeveenkkmsqpgspspktptpstpgdtqp 
ntpapvppaedgikieenslkeeesiegekevkstapetaiect 

aadvekveeksaidltpiwedkeekkeeeekkevmlqngetpk 
dlmdekqk:<nikqrfmfniadggftelhslwqneeraatvtkkt 
yeiwhrrhdywllagiinhgyarwqdiqndpryailnepfkgem 
krjwflbiknkflarrfklleqalvieeqlrraaylnmsedpsh 
psmalntrfaeveclaeshqhlskesmagnkpanavlhkvlkql 
eellsdmkadvtrlpatiarippvavrlqmsernilsrlanrap 
eptpqqvaqqq 


6006 


1 


965 


DND] 1 RK I7HRHE P P VTAE P I RLLAENED WWD KP S S IPVHP C 
GRFRHKTVIFILGKEHQLKELHPLHRLDRLTSGVLMFAKTAAVS 
ERIHEQVRDRQLEKEYVCRVEGEFPTEEVTCKEPILVVSYKVGV 
CRTOPRGKPCETVFQRLSYNGQSSWRCRPLTGRTHQrRVHLQF 
LGHPILNDPIYNSVAWGPSRGRGGYIPKTKEELLRDLVAEHQAK 
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Predicted 

beginning 

nucleotide 

location 

corresponding 



Predicted end 



Amino acid segment containing signal peptide" 
!A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
:leotide insertion) 



QSLDVLDLCEGDLSPC3LTDSTAPSSELGKDDLEELAAAA\QKME~ 
EVAEAAPQELDTIAIASEKAVETDVMNQ\RQT\TLCRVPAGATG 
SLAPRPCDVPTCPTL 



helgqve yvftdktgtltenemqfrecs ingmkyqe ingrlvpe '" 
gptpdssegnlsylsslshlnnlshlttsssfrtspenetelik 
ehdlpfkavslchtvqin1cvqtdctgdgpwqswlapsqleyyas 
spdekalveaaariglvfignseetmevktlgklerykijjhile 
fdsdrrrmsvivqapsgekllfakgaessilpkciggeiektri 
hvdefalkglrtlciayrkftskeyeeidkr:feartalqqr\e 
eklaavfqfiekdlillgatavedrlqdkvretiealrmagikv 
wvltgdkhetavsvsl2cghfhrtmnilelinqk3dsecaeqlr 
qlarritedhviqhglwdgtslslalreheklfmevcrncsav 
lccrmaplqkakvirl1kispekpitlavgdgandvsmiqeahv 
gigimgkegrqaarnsdyaiarfkflskllfvhghfyyiriatl 
vqyffyknvcfitpqflyqfyclfsqqtlydsvyltly\nicft 
slpiliyslleqhvdfhvlqnkptlyrdisknrllsiktflywt 
ilgfshafifffgsylligkdtsllgmgqmfgnwtfgtlvftvm 
vitvtvkmalethfwtwinhlvtwgsiifyfvfslfyggilwpf 
lgsqnmyfvf iqllssgsawfai ilmwtclfldi ikkvfdrhl 
mptstekaqltetnagikcldsmccfpegeaacasvgrmlerv] 
grcspthisrsws asdpfytndrk iltlstmdsstc 



AGVRRAGARRGPGRALPAGATAVPPPSARRRRRCPAL'blHAGI'AW 
ASRPSQETMFOLl^/WNLGSLRJWRKTVKKILSDIGLEYCKEHIE 
3FKQFEPNDFYLKNTTWEDVGLWDPSLTKNQDYRTKPFCCSACP 
FSSKFFSAYKSHFRNVHSEDFENRILI.NCPYCTFIIADKKTLETH 
IKIFHAPNASAPSSSLSTFKDKNKNDGLKPKQADSVEQAVYYCK 
KCTYRDPLYEIVRKHIYREHFQHVAAPYIAKAGEKSLNGAVPLG 
SNAREESSIHCKRCLFMPKSYEALVQHVIEDHERIGYQVTAMIG 
HTNVWPRSKPLMLIAPKPQDKKSMGLPPRIGSLASGNV\RSLP 
SQQMVNRLSIPKPIJLNSTGVNMMSSVHLQQNNYGVKSVGQGYSV 
GQSMRLGLGGNAPVSIPQQSQSVKQLLPSGNGRSYGLGSEQRSQ 
APARYSLQSANASSLSSGQLKSPSLSQSQASRVLGQSSSKPAAA 
ATGPPPGNTSSTQKWKICTICNELFPENVYSVHFEKEHKAEKVP 
AVANYIMKIHNFTSKCLYCNRYLPTDTLLNHMLIHGLSCPYCRS 
TFNDVEKMAAHMRMVHIDEEMGPKTDSTLSFDLTLQQGSHTNIH 
LLVXTYNLRDAPAESVAYHAQNWPPVPPKPQPKVQEKADIPVKS 
SPQAAVPYKKDVGKTLCPLCFSILKGPISDALAHHLRERHQVIQ 
TVHPVEKKLTYKCIHCLGVYTSNMTASTITLHLVECRGVGKTQN 
GQDKTNAPSRLNQS PS LAPVKRT YEQME FPLLKKRKLDDDSDS P 
SFFEEKPEEPWLALDPKGH\EDDSYEARKSFLTKYFT\KQPYP 
TRREIEKLAASLWV\WK\SDIASHFSNKRKKCVRDCEKYKPGVL 
LGFNMKELtlKVKHEMDFDAEGLFENHDEKDSRVNASKTADKKLN 
LGKEDDSSSDSFENLEEESNESGSPFDPVFEVEPKISKDNPEEH 
VLKVIPEDASESEEKLDQKEDGSKYETIHLTEEPTKLMHNASDS 
EVDQDDWEWKDGASPSESGPGSQQVSDFEDNTCEMKPGTWSDE 



chglqhltpfrelnlslqg*eph*aa*qavrseeksic*gspsc 
klvlgvlvpvarqsshsagpaqsafr*tgtgsgtpkaaeqsgyw 
eaytlghqhwnmfpiqrpplvmkgrrimcgkcekg*vsdsvtgg 
ravageqasqrrtvftagggeclgaksvrasvftgnqpgvmgll 
ngkrggcfesgylfgfivigkiqsleakvplpvngqtgeraspg 
kcr:hivdavc*sehh*dhflaaaflenstiis*vapgswqdha 
vlqkevqasvrcrgfesvdtapagfwahsppglqgeptttsvsl 
fvlapqdgegvpfvegqlvtvlglwpqsirhtfvhhtqlflhp 
i*klgaldvaflhlltlvcssfnvayg*gknggttlhqlfaevn 
avtrgsavqrrps iti ss ihvdtkiqqelhdvmvagadgwqwg 
dpfwglagifhliddplhqielsfqrrv*eqcqgvkpdsqpvp 
rplrvgllqvgplvrgggrrvagrgkrcwrdllfpwrwglshrt 
rdllrggdrghvvvivlcrlgslvgglgtdellwfggr*lii ig 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 

residue of 
sequence 


Predicted end 

corresponding 
to first 

sequence 


Am^no acid segment containing signal peptide 
(A=AIanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion! 








1**1 J.-^SGEWGCGLGRGELFQVSIGIGVS I VH I GQGDHEVLGG 
AGLVERGALHATGOGVEALVQQLLDVGPAGALGLCDGAALFQGP 
GPA'GQLPAEGLQVGITLVAQWRMHDGRELGGAF.WPWQALHGAAI 
CGVGGAILLKALSQYFLKGG*RLWCARGQ*PVKKRQRRWRG*TR 
R*NGLTIHCFN*LI*GAVCCRLVILRWCGLLEVHGVYGT*IHCL 
GSFPGRLWP*PFISQERPNGHCQWEFRLAVPSWKCRWSRWRVRG 
TWRYGNPLLNLL*GANLGGAACGGQQGGPLSTWQACTGPGQAAF 
LPPFQGACRPRTQRCRTWVCPIAWRQLLAYTED 


6010 




3533 


IMPCGSSRLLRGCWTHPNEPVSDLSYFDCIESVMENSKVLGESM 
AGISQNAKTGDLPAFGECVGIAS KALCGLTEAAAQAAYLVGIFD 
PNSQAGHQGLVDPIQFARANQAIQMACQNLVDPG3SPSQVLSAA 
TIVAKHTSALCNACRIASSKTANPVAKRHFVQSAKEVANSTANL 
VKTI KALDGDFS EDNRNKCRI ATAPL I EAVEMLTAFASNPEFVS 
IPAQISSEGSQAQEPILVSAKPMLESSSYLIRTARSLAINPKDP 
PTWSVLAGHSHTVSDSIKSLITSIRDKAPGQRECDYSIDGINRC 
IRDIEQASLAAVSQSLATRDDISVEALQEQLTSWQEIGHLIDP 
IATAARGEAAQLGHKGTQLASYFEPLILAAVGVASKILDHQQQM 
TVLDQTKTLAESALQMLYAAKEGGGNPKAQHTHDAITEAAQLMK 
EAVDDIMVTTiNEAASEVGLVGGMVDAIAEAMSKLDEGTPPEPKG 
TFVDYQTTWKYSKAIAVTAQEMMTKSVTNPEELGGLASQMTSD 
YGHLAFQGQMAAATAEPEEIGFQIRTRVQDLGHGCIFLVQKAG\ 
.MCVCPTDSYTKRELIECARAVTEKVSLVLGALQAGNKGTQACI 
TAATAVSGIIADLDTTIMFATAGTLNAENSETFADHRENILKTA 
KALVEDTKLLVSGAAST PDKLAQAAQ S SAAT I TQLAE WKLGAA 
3LGSDDPETQVVLINAIKDVAKALSDLISATKGAASKPVDDPSM 
YQLKGAAKVMVTNI/TSLLF^TVKAVEDKA'rHfrrRAIjEATIECIKQ 
3LTVFQSKDVPEKTSSPEESIRMTKG1TMATAKAVAA3N5CRQE 
DVIATANLSRKAVSDMLTACKQASFHPDVSDEVRTRALRFGTEC 
TLGYLDLLEHVLVILQKPTPELKQQLAAFSKRVAGAVTELIQAA 
EAMKGTEWVDPEDPTV1AETELLGAAAS I EAAAKKLE2LKPRAK 
PKQADETLDFEEQILEAAKSIAAATSALVKSASAAQRELVAQGK 
VGSIPANAADDGQWSQ3LISAARMVAAATSSLCEAANASVQGHA 
SEEKLISSAKQVAASTAQLLVACKVKADQDSEAMRRLQAAGNAV 
KRASDNLVRAAQKAAFGKADDDDVWKTKFVGGIAQIIAAQEEM 
1KKERELEEARKKLAQIRQQQYKFLPTELREDEG 


6011 


446 


1S35 


LLQFAMRKSPGLSDCLWAWILLLSTLTGRSYGQPSLQDELKDNT 
TVFTRILDRLLDGYDNRLRPGLGERVTEVKTDIFVTSFGPVSDH 
DMEYTIDVFFRQSWKDERLKFKGPMTVLRLNNLMASKIWTPDiF 
FHNGKKSVAHNMTMPNKLLRITEDGTLLYTMRLTVR\AECPMAF 
GRDFPM\D\AHACPLKFGSYAYTRAEWY3WTREPARSWVAED 
GSRLNQYDLLGQTVDSGIVQSSTGEYVVMTTHFHLKRKIGYFVI 
QTYLPCIMTVILSQVSFWLNRESVPARTVFGVTTVLTMTTLSIS 
ARNSLPKVAYATAMDWFIAVCYAFVFSALIEFATVNYFTKRGYA 
WDGKSWPEKPKKVKDPLIKKNNTYAPTATSYTPNLARGDPGLA 
T1AKSATIEPKEVKPETKPPEPKKTFNSVSKIDRLSRIAFPLLF 
GIFNLVYWATYLNREPQLKAPTPH2 


S012 


351 


5013 


PAELFQSFA-WHKELYDWRLGPWNQCQPVISKSLEKPLECIKGE 
EG I Q VRE I AC I QKDKD I PAED IICEYFEP KPLLEQACL I PCQQD 
CIVSEFSAWSECSKTCGSGLQHRTRHWAPPQFGGSGCPNLTEF 
QVCQS S P CEAE ELRYS LHVGPWS TCSMPHSRQVRQARRRGKNKE 
REKDRSKGVKDPEARELIKKKRNRNRQNRQENKYWDIQIGYQTR 
EVMCINKTGKAADLSFCQQEKLPMTFCSCVITKECQVSEWSEWS 
PCSKTCHDMVSPAGTRVRTRTIRQFPIGSEKECPEFEEKEPCLS 
QGDGWPCATYGWRTTEWTECRVDPLLSQQDKRRGWQTALCGGG 
IQTREVYCVQANENLLSQLSTHKNKEASKPMDLKLCTGPIPNTT 
QLCHIPCPTECEVSPWSAWGPCTYENCNDQQGKKGFKLRKRRIT 
MEPTGGSGVTGMCEHLLEA1PCEEPACYDWKAVRLGDCEPDNGK 
ECGPGTQVQEWCINSDGEEVDRQLCRDAIFPIPVACDAPCPKD 
CVLSTWSTWSSCSHTCSGKTTEGKQIRARSILAYAGEEGGIRCP 
NSSALQEVRSCNEHPCTVYHWQTGPWGQCIEDTSVSSFNTTTTW 
NGEASCSVGMQTRKVICVRVNVGQVGPKKCPESLRPETVRPCLL 
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SEQ 
ID 


Predicted 
beginning 

corresponding 

amino acid 
sequence 


Predicted end 
nucleotide 

corresponding 
to first 

residue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspar;ic Acid", E = 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine , R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion! 








pckkdcivtpysdwtscps\sckegdssirkqsrhrviiqlean 
ggrdctdplyeekaceapqacqsyrw\kthkw\hrcq\lvp\ws 
vqqdsp\gaqegcgpgrqaraitcrkqdggqagiheclqyagpv 
paltqacq i pcqddcqlts ws kfss cngdcgavrtrkrtlvgks 
kkkekcknshlypliei'qycpcdkynaqpvgnwsdcilpegkve 
vllgmkvqgdikecgqgyryqamacydqngrlvetsrcnshgyi 
eeaciipcpsdcklsewsnwsrcskscgsgvkvrskwlrekpyn 
ggrpcpkldhvnqaqvyevvpchsdcnqylwvtepws i ckvtfv 
nmrencgegvqtrkvrcmqntadgpsehvedylcdpeemplgsr 
vcklpcpedcvisewgpwtqcvlpcnqssfrqrsadpirqpade 
grscpnavekepcnlnkncyhydynvtdwstcqlsekavcgngi 
ktrml3cvrsdgksvdlkycealgleknwqmntscmvecpvncq 
lsdwspwsecsqtcgltgxmirrrtvtqpfqgdgrpcpslmdqs 
kpcpvkpcyrwqy3qwspcqvqeaqcgegtrtrniscvvsdgsa 
ddfskvvdeefcadibliidgnknmvleescsqpcpgdcyijkdw 
sswslcqltcvngedlgfggiqvrsrpviiqelenqelcpeqml 
etkscydgqcyeykwmasawkgssrtvwcqrsdginvtggclvm 
sqpdadrscnppcsqphsycsetktchceegytevmssnstleq 
ctlipvwlptmedkrgdvktsravhptqpssisrpagrgrtwflq 
pfgpdgrlktwvy3vaagafvllifivsmiylackkpkkpqrrq 


6013 


11S1 


710 


GAFIAGVFVQPVLIR^PNSLDTTSWAWRCiGVl K ,'LV- -'lASOTC 
S I VDVE FLPVYHP S PE E S RDPTL YANNVQRVMAQALG I PATECE 
FVGSLPVIWGRLKVALEPQL/WGTGKSASEGWAVRKLCGRWGR 
ARPESNDQPGRVCQftATAL 


6014 


2857 


613 


E.avAGGMEKSRMNLPKGPDTT,CFnKDEFlv:KEDFDTOHFVSI)rHK 
RVQLEELRDDLELYYKLLKTAMVELINKDYADF\VNLSTNLV3M 
DKALNQLSVPLGQLREEVLSLRSSVSEGIRAVDERMSKQEDIRK 
KKMCVLRLIQVIRSVEKIEKILNSQSSKETSALEASSPLLTGQI 
LERIATEFNQLQFHACQSK\GMPLLDKVRPRIAGITAMLQQSLE 
3LLLEGLQTSDVDIIRHCLRTYATIDKTRDAEALVGQVLVKPYI 
DEVIIEQFVESHPMGLQVMYNKLLEFVPHHCRLLREVTGGAISS 
EKGNTVPGYDFLVNSVWPQIVQGLEEKLPSLFNPGNPDAFHEKY 
TISMDFVRRLERQCGSQASVKRLRAHPAYHSFWKKWNLPVYFQI 
RFREIAGSLEAALTDVLEDAPAESPYCLLASHRTWSSLRRCHSD 
EMFLPLLVHRLWRLHSGRFWARYSVFV\N\ELSLRPISNESPKE 
IKKPLVTGSKEPS ITQGNTED2GSGPSETKPWS ISRTQLVYW 
ADLDKLQEQLPELLEIIKPKLEMIGFKNFSSISAALEDSQSSFS 
ACVPSLSSKIIQDLSDSCFGFLKSALEVPRLYRRTNKEVPTTAS 
SYVDSALKPLFQLQSGHKDKLKQAIIQQWLEGTLSESTHKYYET 
VSDVLNS VKKME E S LKRLKQARKTT PANPVGP SGGMSDDD K I RL 
QLALDVEYLGEQ I Q KLGLQAS D I KS FSALAELVAAAKDQATAEQ 
P 


6015 


13 


2237 


A3GCAERRGTEPWELSMSWESGAGPGLGSQGMDLVWSAWYGKC 
VKGKGSLPLSAHGIWAWLSRAEWDQVTVYLFCDDHKLQRYALN 
R ITVWRSRSGNELPLAVAS TADL IRCKLLDVTGGLGTDELRLLY 
GMALVRFVNLISERKTKFAKVPLKCLAQEVNIPDWIVDLRHELT 
HKKMPHIWDCRRGCYFVLDWLQKTYWCRQLENSLRETWELEEFR 
SGIEEEDQEEDIQMIWDDITEQKPEPQDD3KSTESDVKADGDSK 
GSEEVDSHCKKALSHKELYERARELLVSYEEEQFTVLEKFRYLP 
XAIKAWNNPSPRVECVLAELKGVTCENREAVLDAFLDDGFLVPT 
FEQLAALQIEYEENVDLNDVLVPKPFSQFWQPLLRGLHSQNFTQ 
ALLERMLSELPALGISGIRPTYILRWTVELIVANTKTGRNARRF 
SAGQWEARRGKRLFNCSAS LDWPRMVES CLGS PCWAS PQLLRI I 
?\KAMGQGLODE\EQEKLLRICSIYTQSGENSLVQEGSEASPIG 
KSPYTLDSLYWSVKPASSSFGSEAKAQQQEEQGSVWDVKEEEKE 
EKEVLPDQVEEEEENDDQEEEEEDEDDEDDEEEDRMEVGPFSTG 
QESPTAENARLLAQKRGALQGSAWQVSSEDVRWDTFP\LGRMPR 
SRPRTPAELMLENYDTHVI FWTKPVL\EQRLEPSTCK\TDTLGL 
\3CC-VGS\GKCSNSSSSNFRGAFLLEARGSLH\GL\KTGLQLF 


£016. 


13 


2237 


AEGCAERRGTEPWELSKSKESGAGPGLGSQGMDLVWSAWYGKC 
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SEQ 
ID 


Predicted 

nucleotide 
location 
corresponding 
to first 

residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 

residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine / K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine , V=Valine, 
W=Tryptophan, Y=Tyrosina, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








VKGKGSLPLSAHGIWAWLSRAEWDQVTVYLFCDDHKLQRYALN 
R I TVWRS RS GNELP LAVAS TADL I RCKLLDVTGGLGTDE LRLL Y 
GMALVRFVNLISBRKTKFAKVPLXCIAQEVNIPDWIVDLRHELT 
HKKKPIIINDCRRGCYFVLDWLQKTYWCRQLENSLRETWELEEFR 
EGIEEEDQEEDKNIWDDITEQKPEPQDDGKSTBSDVKADGDSK 
GSEEVDSHCKKALSHKELYERARELLVSYEEEQFTVLEKFRYLP 
KAIKAWNNPSPRVECVLAELKGVTCENEEAVLDAFLDDGFLVPT 
FEQLAALQIEYEENVDLKTDVLVPKPFSQFWQPLLRGLHSQNFTQ 
ALLERMLSELPALG1SGIRPTYILRWTVELIVANTKTGRNARRF 
SAGQWEARRGWRLFNCSASLDWPRMVESCLGSPCWASPQLLRII 
F\KAMGQGLQDE\EQEKLLRICSIYTQSGENSLVQEGSEASPIG 
KSPYTLDSLYWSVKPASSSFGSEAKAQQQEEQGSVNDVKEEEKE 
EKEVLPDQVEEEEENDDQEEEEEDEDDEDDEEEDRMEVGPFSTG 
QESPTAENARLLAQKRGALQC-SAWQVSSEDVRWDTFP\LGRMPR 
SRPRTPAELMLENYDTHVIFWTKPVL\EQRLEPSTCK\TDTLGL 
\SCGVGS\GNCSNSSSSNFRGAFLLEARGSLH\GL\KTGLQLF 


6017 


203 


3469 


NFALQTMEPALPMPPVEELDVMFSELVDELDLTDKHREAMFALP 
AEKKWQIYCSKKKDQEENKGATSWPEFYIDQLKSMAARKSLLAL 
EKEEEEERSKTIESLKTALRTKPMRFVTRFIDLDGLSCILNFLK 
TMDYETSESR IHTSL I GC I KALMNN SQGRAHVLAHS ES INVIAQ 
SLSTENIKTKVAVLEILGAVCLVPGGHKKVLQAMLHYQKYASER 
TRFQTLINDLDKSTGRYRDEVSLKTAIMSFINAVLSQGAGVESL 
DFRLHLRYE \ FLMLG I HP VMDKLRKH3NSTLDRHLDFFEMLRNE 
DELEFAKRFELVHIDTKSATQMFELTRKRLTHSEAYPHFMSILH 
Hr.'r,Qr4?YKRSGNTVQYWLLLDRIIQQIVIQWDKGQDPDSTPLEN 
- /RMLVN 3WKE EKKRKEHMELOC LEKKEREC 
DAKTQEKEEMMQTLNKMKEKLEKETTEHKQVKQQVAELTAQLHE 
LSRRAVCASIPGGPSFGAPGGPFPSSVPGSLLPPPPPPPLPGGM 
LPPPPPPLPPGGPPPPPGPPPLGAIMPPPGAPMGLALKKKSIPQ 
PTNALKSFNWSKLPENKLEGTVWTEIDDTKVFKILDLEDLERTF 
SAYQRQQDFFTOSNSKQKEADAIDDTLSSKLKVKELSVIDGRRA 
QNCNILLSRLKLSNDEIKRAILTMDEQEDLPKDMLEQLLKFVPE 
KSDIDLLEEHKHELDRMAKADRFLFEMSRINHYQQRLQSLYFKK 
KFAERVAEVKPKV2AIRSGSEEVFRSGALKQLLEVVLAFGNYMN 
. , ^IJAYGFKISSLNKIADTKSSIDKNITLLHYLITIVENKYP 
SVLNLNEELRDIPQAAKVNMT3LDKEISTLRSGLKAVETELEYQ 
KSCPPQPGDKFVHWSQFITVASFSFSDVEDLLAEAKDLFTKAV 
KHFGEEAGKIQPDEFFGIFDQFLQAVSEAKQENENMRKKKEEEE 
RRARMEAQLKEQRERERKMRKAKENSEESGEFDDLVSALRSGEV 
FDKDLSKLKRNRKRITNQMTDSSRERPITKLNF 


S018 


13 


2510 


TISQSGGIRRRREAVWFEWNHDFSRLHMYSPPQCVPENTGYTY 
ALSSSYSSDALDFETEHKLDPVFDSPRMSRRSLRLATTACTLGD 
GEAVGADSGTSSAVSLKNRAARTTKQRRSTNKSAFSINHVSRQV 
TSSGVSYGGTVSLQDAVTRRPPVLDESWIREQTTVDHFWGLDDD 
GDLKGGNKAAIQGNGDVGAGAATGHNGFFCSNCNMLS3RKDVLT 
AHPAAPGPVSRVYSRDRNQKCDDCKGKRHLDAHPGRAGTLWHIW 
ACA3YFLLQILRRIGAVGQAVSRTAWSALWLAVVAPGKAASGVF 
WWIjGIGVr/QFVTLISWLWFLLTRCLRNICKFLVLLI^LFLLLG 

lslrgqg\nffsflpvlnwasmhrtqrvddpqdvfkpttsrlkq 
plqgdseafpwhwmsgveqqvaslsgqchhhgenlrelttllqk 
lqarvdqmeggaagpsasvrdavgqppretdfmafhqehevrms 
hled1lgklrekseaiqkeleqtkqktisavgeqllptvehlql 
eldqlkselsswrhvktgcetvdavqervdvqvremvkllfsed 
q;c-glleqllqrfssqfvskgdlqtmlrdlqlqilrnvthhvsv 
tkqlptseawsavseagasgiteaqaraivn3alklysqdktg 
mvi 'ai esgggsilstrcsetyetktalmslfgiplwyfsqspr 

\ J QI EIYPGNCWAFKGSQGYLWRLSMiyilHPAAFTLEHIPKTL 
SPTGNISSAPKDFAVYGLENEYQEEGQLLGQFTYDQDGESLQMF 
QALKRPDDTAFQIVELRIFSNWGHPEYTCLYRFRVHGEPVK 


6019 


2 


1056 


TPNDREPPPQRPPgSRRASHLAQEITSAASLGDQTQILGSLTTA 
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SEQ 
ID 
NO: 


Predicted 

nucleotide 

location 

corresponding 

amino acid 
residue of 
amino acid 


Predicted end 
nucleotide 

corresponding 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threor.ine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








V. I R SMPG I S S Q I LTNAQGQVI GTL PW WNS AS VAAPAPA ' 
. 1 ,A/TPQLLLNAQGQVIATLASSPLPPPVAVRK\PSTPES 
LLKSEVQPIKPTPTVPQPAWIASPAPAAKPSASAPIPrTCSET 
PTVSQLVSKPHTPSLDEDGIMLEE1REFAKWFKIRRLSLGLTQT 
QVGQALTATEGPAYSQSAICRFEKLDITPKSAQKLKPVLEKWLN 
EAELRNQEGQQNLMEFVGGEPSKKRKRRTSFTPQAIEALNAYFE 
KNPLPTGQE I TE I AKELNYDRE WRVWFCNRRQTLKNTS KLNVF 
QIP 


S020 


4953 


549 


EAIQFEVSIGMYGNKFDTTCKPLASTTQYSRAVFDGNYYYYLPW 
AHTKPW7LTSYWEDISHRLDAVNTLLAMAERLQTNIEALKSGI 
QGKIPANQLAELWLKLIDEVIEDTRYTLPLTEGKAmfTVLDTQI 
RKLRSRSLSQIHEAAVRMRSEATDVKSTLAEIEDWLDKLMQLTE 
EPQNSMPDI I IWMIRGEKRLAYARIPAHQVLYSTSGENASGKYC 
GKTQTIFLKYPQEKN1IGPKVPVELRVNIWLGLSAVEKKFNSFAE 
GTFTVFAEMY3NQALMFGKWGTSGLVGRHKFSDVTGKIKLKREF 
FLPPKGWEWEGENIVDPERSLLTEADAGHTEFTDEVYfiNESRYP 
GGDWKPAEDTYTDAHGDKAASPSELTCPPGWEWEDDAWSYDINR 
AVDEK3WEYGITIPPDHKPKSWVAAEKMYHTHRRRRLVRKRKKD 
LTQTASSTAGAMEELQDQEGWEYASLIGWKFHWKQRSSDTFRRR 
RWRRKMAPSETHGAAAIFKLEGALGADTTEDGDEKSLEKQKHSA 
TTVFGANTPlVSCNFDRDYIYHLRCYVYQARNLLATiDKDSFSDP 
YAHICFLHRSKTTEIIHSTLNPTWDQTIIFDEVEIYGEPQTVLQ 
NPPKVIMELFDNDQVGKDEFLGRSIF3PWKLNSEMDITPKLLW 
HPVMNGDKACGDVLVTAEI,ILRGKDGSNLPILPPQRAPNI,YMVP 
QGIRPWQLTAIEILAWGr.RNMKNFQMASITSPSLWECGGERV 
ESWIKNLKKTPNFPSSVLFMKVFLPKEELYMPPLVIKVIDHRQ 
FGRKPWGQCTIERLDRFRCDPYAGKEDIVPQLKASLLSAPPCR 
DIVIEMEDTKPLLASKCLSSMSTALSKMASPATVHLTEKEEEIV 
DWWSKFYASSGEHEKCGQYIQKGYSKLKIYNCELENVAEFEGLT 
DFSDTFKLYRGKSDENEDPSWGEFKGSFRIYPLPDDPSVPAPP 
RQFRELPDSVPQECTVRIYIVRGLELQPQDNNGLCDPYIKITLG 
KKVIE\DRDHYIPNTLNPVFGRMYELSCYLPQEKDLKISVYDYD 
TFTRDEKVGETIIDLENPF\LSRFG\SHCG\IPEEYCVSGVNTW 
RD£LR\PTQ\LLQNVARFKGFPQPILSEDGSRIRYGGRDYSLDE 
FEANKILHQHLGAPEERLALHILRTQGLVPEHVETRTLHSTFQP 
NIS \RYYLRVI IWNTKDVILDEKS ITGEEMSDIYVKGWIPGNEE 
NKCKTDVHYR5LDGEGNFNWRFVFPFDYLPAEQLCIVAKKEHFW 
= 1 r v ,TK^RIPPR\LIIQIW\DNDKFS\LDDYLGFPRTLTCRHTI 
HFLQKS PGGNC / RGLDMI PDLKAMNPLKAKTASLFEQKSMKGWW 
PCYAEKDGARVMAGKVEMTLEILNEKEADERPAGKGRDEPNMWP 
KLDLPNRPETSFLWFTNPCKTHKFIVWRRFKWVIIGLLFLLILL 
LFVAVLLYSLPNYLSMKIVKPNV 


6021 


4953 


549 


EAIQFEVSIGNYGNKFDTTCKPLASTTQYSRAVFDGNYYYYLPW 
AHTKPWTLTSYWEDISHRLDAVNTLLAMAERLQTNIEALKSGI 
QGKIPANQLAELWLKLIDEVIEDTRYTLPLTEGKANVTVLDTQI 
RKLRSP.SL3QIHEAAVRMRSEATDVKSTLAEIEDWLDKLMQLTE 
EPQNSMPDI IIWMIRGEKRLAYARIPAHQVLYSTSGENASGKYC 
GKTQT I FLKYPQEKNNGPKVPVELRVNIWLGLSAVEKKFNSFAE 
GTFTVFAIEKYbTvOALMFGKWGTSGLVGRHKFSDVTGKXKLKREF 
FIiPPKGWEWEGEWIVDPERSLLTEADAGHTEFTDEVYQWESRYP 
GGDWKPAEDTYTDANGBKAASPSELTCPPGWE'/JEDDAWSYDINR 
&i TU E - GWE YGITI PPDHKPKSWVAAEKM YHTHRRRRLVRKRKKD 
LTQTASSTAGAMEELQDQEGWEYASLIGWKFHWKQRSSDTFRRR 
J! WRRKKAP S ETHGAAAI FKLEGALGADTTEDGDEKSLEKQKHSA 
F ANTPIVSCNFDRDYIYHLRCYVYQARNLLALDKDSFSDP 
YJ'sPiiCFLHRSKTTEI IHSTLNPTWDQTI1FDEVEIYGEPQTVLQ 
NPPKVIMELFDNDQVGKDEFLGRSIFSPWKLNSEMDITPKLLW 
HPVMNGDKACGDVLVTAETiILRGKDGSNLPILPPQRAPNLYMVP 
QGIRPWQLTAIEILAWGLRNMKNFQMASITSPSLWECGGERV 
ES'^-IKNLKKTPNFPSSVLFMKVFLPBCEELYMPPLVIKVIDHRQ 
FGRKPWGQCTIERLDRFRCDPYAGKEDIVPQLKASLLSAPPCR 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first 

amino acid 


Predicted end 

corresponding 

to first 
amino acid 
residue of 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=3top 
Codon, /-poosible nucleotide deletion, 
\=possible nucleotide insertion) 








DIVIEMEDTKP1LASKCLSSMSTALSKMASPATVHLTEKEEEIV 
DMWSKFYASSGEHEKCGQYIQKGYSKLKIYNCELENVAE7EGLT 
DFSDTFKI.YRGKSDEMEDPSWGEFKGSFRIYPLPDDPSVPAPP 
RQFRELPDSVPQECTVRIYIVRGLELQPQDHNGLCDPYIKITLG 
KKVIE\DRDHYIPNTLNPVFGRMYELSCYLPQEKDLKISVYDYD 
TFTRDEKVGETIIDLENPF\LSRFG\SHCG\IPEEYCVSGVNTW 
RDSLR\PTQ\LLQNVARFKGFPQPILSEDGSRIRYGGRDYSLDE 
FEANKILHQHLGAPEERLALHILRTQGLVPEHVETRTLHSTFQP 
NIS\RYYLRVIIWNTKDVILDEKSITGEEMSDIYVKGWIPGNEE 
NKQKTDVHYRSLDGEGNFNWRFVFPFDYLPAEQLCIVAKKEHFW 
SIDQTEFRIPPR\LIIQIW\DNDKFS\LBDYLGFPRTLTCRHTI 
HFLQKSPC-GNC/RGLDMIPDLKAMNPLKAKTASLFECKSMKGWW 
PCYAEKDGARVMAGKVEMTLEILNEKEADERPAGKGRDEPWMNP 
KLDLPNRPETSFLWFTNPCKTMKFIVWRRFKWVIIGLLFLLIBL 
LFVAVLLYSLPNYLSMKIVKPNV 


6022 


4953 


549 


eaiqfevsigiiygnkfdttckplasttqysravfdgnyyyylpw 
ahtkpwtltsywedishrldavntllamaerlqtniealkssi 
qgkipanqlaelwlklideviedtrytlpltegkanvtvldtqi 
rklrsrslsqiheaavrmrseatdvkstlaeiedwldklmqlte 
epqnskpbi iizwmirgekriiayaripahqvlystsgenasgkyc 
gktqtiflkypqeknngpkvpvelrwiwlglsavekkfnsfae 
gtftvfaemyenqalkfgkwgtsglvgrhkfsdvtgkiklkref 
flppkgwewegewivdpersllteadaghteftdevyqnesryp 
ggdwkpaedtytdangdkaaspseltcppgweweddawsydinr 
avdekgweygitippeiikpkswvaaekmyhthrrrrlvrkrkkd 
ltqtasstagameelqdqegweyasligwkfhwkqrssdtfrrr 
rwrrkroapsethgaaaifklegalgadttedgdekslekqkhsa 
ttvfgantpivscnfdrdyiyhlrcyvyqarnllaldkdsfsdp 
yah i cflhrsktte i ihstlnptwdqt 1 1 fdeve i ygepqtvlq 
nppkvi melfdndqvgkde flgrs i fs p wklnsemd itpkllw 
h p vmngdkacgdvl vtae l i lrgkdgs nlp i lp pqrapnl ymvp 
qg irpwqltai e i lawglrnmknfqmas i tspslwecggerv 
eswiknlkktpwfpssvlfmxvflpkeelympplvikvidhrq 
fgrkpwgqctierldrfrcdpyagkedivpqlkasllsappcr 
diviemedtkpllaskclssmstalskmaspatvhlt3keeeiv 
dwwskfyassgehekcgqyiqkgysklki yncelenvaefeglt 
dfsdtfklyrgksdenedpswgefkgsfriyplpddpsvpapp 
rqfrelpd3vfqectvriyivrglelqpqdnnglcdpyikitlg 

KKVIE\DRDHYIPNTLNPVFGRMYELSCYjPQEKDLKISVYDYD 
TFTRDEKVGETIIDLENPF\LSRFG\SHCG\IPEEYCVSGVNTW 
RDSLR\PTQ\LLQNVARFKGFPQPILSEDGSRIRYGGRDYSLDE 
FEANKILHQHLGAPEERLALHILRTQC-LVPEHVETRTLHSTFQP 
NIS\RYYLRVIIWNTKDVILDEKSITGEEMSDIYVKGWIPGNEE 
MKQKTDVHYRSLDGEGNFNWRFVFPFDYLPAEQLCIVAKKEHFW 
SID3TEFRIPPR\LIIQIW\DNDKFS\LDDYLGFPRTLTCRHTI 
F , ^-GGNC/RGLDMIPDLKAMNPLKAKTASLFEQKSMKGWW 
PCYAEKDGARVMAGKVEMTLEILNEKEADERPAGKGRDEPNMNP 
KLDLPNRPETSFLWFTNPCKTMKFIVWRRFKWVIIGLLFLLILL 
LFVAVLLYSLPNYLSMKIVKPNV 


6023 


102 


916 


SQELGMFVELNNLLNTTPDEAEQGKLTLLCDAKTDGSFLVHHFL 
S FYLKANCKVCFVALIQSFSHYS IVGQKLGVSLTMARERGQLVF 
LEGL/IVCSGR\VFQAQKEPHPLQFLREANAGNLKPLFEFVREA 
LI I 'CSGEARWTYPVLLVDDLSVLLSLGMGAVAVLDFIHYCRAT 
V CW Z LKGNMWLVEDSGDAEDEENDILLNGLSHQSHLI LRAEGL 
ATGFCRDVHGQLRILWRRPSQPAVHRDQSFTYQYKIQDKSVSFF 
AKGMSPAVL 


6024 




3260 


FLSFLCYPRFRCLFCLQFAIPASRMEQLNELELLMEKSFWEEAE 
LFAELFQKICWAGFPRTVLSTGMDNRYLVLAVNWQNKEGNCEK 
RLVITASQSLENKELCZLRNDWCSVPVEPGDIIHLEGDCTSDTW 
IIDKDFGYLILYPEMLISGTSIASSIRCMRRAVLSETFRSSDPA 
TRQML 1 3T VLHEVFQKAINNSFAPEKLQELAFQTIQE IRHLKEM 
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SEQ 
ID 
NO: 


Predicted 
beginning 

corresponding 
to first 

sequence 


Predicted end 

to first 
amino acid 
residue of 
amino acid 
sequence 


Amine acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, P=Phenylalanine, G=Glycine, 
H=Histidine, I = Isoleucine , K=Lysir.e, 
L=Leucine, M=Methionine, N=Asparagine, 
P=?roline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








YRLNLSQDE I KQEVEDYLPS FCKWAGDFMHKNTSTDFPQMQLSL 
PSDNSKDNSTCNIEWKPMDIEESIWSPRFGLKGKIDVTVGVKI 
HRGYKTKYKI MPLELKTGKE SNSI EHRS QWLYTLLSQERRADP 
EAGLLLYLKTGQK YPVPANHLDKRELLKLRNQMAFS LFHRIS KS 
ATRQKTQLASLPQIIEEEKTCKYCSQIGNCALYSRAVEQQMDCS 
SVPIVMLPKIEEETQHLKQTHLEYFSLWCLMLTLESQSKDNKKN 
HQNIWLMPASEMEKSGSCIGNLIRMEHVKIVCDGQYLHNFQCKH 
GAIPVrNLMAGDRVIVSGEERSLFALSRGYVKEINMTTVTCLLD 
RNLSVLPESTLFRLDQEEKNCDIDTPLGNLSKLMENTFVSKKLR 
DLIIDFREPQFISYLSSVLPHDAKDTVACILKGLNKPQRQAMKK 
VLLS KDYTL I VGMPGTGKTTT I CTLVRI LYACGFSVLLTS YTHS 
AVDNILLKLAKPKIGFLRSR\QIQKVHPAIQQFTEHEICRSKSI 
KS\LALLEELYTSQL1DATTCMGZNHPIFSRKIFDFCIVDEASQ 
ISQPICLC-PLFFSRRFVLVGDHQQLPPLVLNREARALGMSESLF 
KRLEQNKSAWQLTVQYRMNSKIMSLSNKLTYEGKLECGSDKVA 
NAVINLRHFKDVKLELEFYADYSDNPWLMGVFEPNNPVCFLNTD 
KVPAPEQVEKGGVSNVTEAKLIVFLTSIFVKAGCSPSDIGIIAP 
YRQQLKIINDLLARSIGMVEVNTVDKYQD\RDKSIVLVSFVRSN 
KDGTVGELLKDWRRT iNVA I TRAKHKL ILLGCVPS LNC Y PPLEKL 
LNHLNSEKLIIDLPSREHESLCHILGDFQRE 


6025 






" "p- -dht,p 'fplrsdllitmstlyvsphpdafpslralia 
arygeagegpgwggaiipriclqppptsrtsfppprlpaleqgpg 
glwvwgatavaqllwpaglggpggsraavlvqqwvsyadtelip 
aacgatlpalglrssaqdpqavlgalgralspleewlrlhtyla 
geaptladlaavtalllpfryvldpparriwnnvtrwfvtgvrq 
pefravlgewlysgarplshqpgpeapalpktaaqlkkeakkr 
eklekfqqkqkiqqqqpppgekkpkpekrekrdpgvitydlptp 
pgekkdvsgpmpdsyspryveaawypwweqqgffkpeygrpnvs 
aanprgvfmmc i p ppnvtgs lhlghaltnaiqdsltrwhrmrge 
tt1wnpgcdhagiatqvwekklkreqglsrhqlgreaflqevw 
kwkeekgdriyhqlkklgssldwdracftmdpklsaavteafvr 
lheegiiyrstrlvnwsctlnsaisdievdkkeltgrtllsvpg 
ykekvefgvlvsfaykvqgsdsdeevwattrietmlgdvavav 
hpkdtryqhlkgknvihpflsrslpivfdefvdmdfgtgavkit 
pahdqndyevgqrhgleaisimdsrgali.wpppflglprfear 
kavlvalkerglfrgiednpmwplcnrskdwepllrpqwyvr 
cgemaqaasaavtrgdlrilperhqrtwhawmdnire\wcmfpg 
w \hr\ ipayfvtvsdpavppgedpdgrywvsgrneaeare 
kaakefgvspdkislqqdedvldtwfssglfplsilgwpnosed 
lsvfypgtlletghdilffwvarmvmlglkltgrlpfrevylha 
ivrdakgrkmskslgnvidpldviygislqglhnqllnsnldps 
evekakegqkadfpagipecgtdalrfglcaymsqgrdinldvn 
ril3yrhfcnklwnatkfalrglgkgfvpsptsqpggheslvdr 
wirsrlteavrlsnqgfqaydfpavttaqysfwlyelcdvylec 
lkpvlkgvdqvaaecarqtlytcldvglrllspfmpfvteelfq 
rlprrkpqappslcvtpypepsecswkdpeaeaalelalsitra 
vrp\lradynlhpesgptcflevad\eatgalasavsgyvqgpg 
qaqvwavaepwglpa?\qgcavalasdecsi\hlqlqg\lldp 
arelg\klq\akrveaq\rqaq\rlr\erra\asgnpvkvpl\e 
vqeadeaklqqteaelrkvdeaialfqkml 


6026 


2674 


514 


gpitflkkkakmkdmplrihvllglaittlvqavdkkvdcprlc 
tceirpwftprsiymeastvdcndlglltfparlpantqilllq 
twwiakieystdfpvnltgldlsqnwlssvtningkpcmpqllsv 
yleenkltelpekclselsnlqelyinhnllstispgafiglen 
llrlhlnsnrlqminskwfdalpnleilmigenpiirikdmnfk 
pl inlrslviaginlte i pdnalvglenles is fydnrl i kvph 
valqkwnlkfldlnk1ipihrirrgdfsnmlhlkelginnmpel 
isidslavdnlpdlrkieatnnprlsyihpnaffrlpkleslkl 
ksnal3al yhgtieslpnlke i s ihsnp ircdcvirwmnmnktn 
irfmepdslfcvdppefqgqnvrqvhfrdmmeiclpliapesfp 
snlnveag3yvsfhcrata\epqpeiywitpsgqkllpnt\ltd 



431 



WO 01/53312 



PCT/US00/34263 



SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first 

amino acid 

amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 

residue of 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Eroline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unkr.own, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








KFYVHSEGTLD1NGVTPKEGGLYTCIATNLVGADLKSVMIKVDG 
SFPQDNNGSLNIKIRDIQANSVLVSWKASSKILKSSVKWTAFVK 
TENSKAAQSARIPSDVKVYNLTHLNPSTEYKICIDIPTIYQKNR 
KKCVKVTTKGLHPDQKEYEKNNTTTLMACLGGLLG I IGVI CL I S 
CLSPEMNCDGGHSYVRNYLQKPTFALGELYPPLINLWEAGKEKS 
TSLKVKATVIGLPTNMS 


6027 


5254 


4148 


GGRRAPGRPGRS IKDEEEETVFREWS FSPDPLPVRYYDKDTTK 
PISFYLSSLEELLAWKPRLEDGFNVALEPLACRQPPLSSQRPRT 
LLCHDMMGGYLDDRFIQGSVVQTPYAFYHWQCIDVFVYFSHHTV 
TIPPVGWTNTAHRHGVCVLGTFITEWNEGGRLCEAFLAGDERSY 
QAVADRLVQ1T\RFFRFDGWLINIENSLSLAAVGNMPPFLRYLT 
TQLHRQVPGGLVLWYDSWQSGQLKWQDELNQHNRVFFDSCDGF 
FTMYNWREEHLERMLGQAGERRADVYVGVDVFARGWWGGRFDT 
DKVGGGFRPRASGPVPPLGPHFLMDLPFPSAPQRNDSSCSSQSG 
L^RCPAPAKLCPH 


S028 




3432 


NCLLLQAKGFHGEIEDLQQWLTDTERHLLASKPLGGLPETAKEQ 
LWHMEVCAAFEAKEETYKSLMQXGQQMLARCPKSAETNIDQDI 
NULSE E EJh -i\Kl\KLEEALNLA\MEFHNSL\QDFIN 
WLTUAEQTLOTASRPSLILDTVLFQIDEHKVFANEVNSHREQ.il 
ELDKTGTHLKYFSQKQDWL I KNLI ,1 SVQSRWEKWQRLVERGR 
SLDDARKRAKQFHEAWSKLMEWLEESEKSLDSELEIANDPDKIK 
TQLAQHKEFQKSLGAKHSVYDTTHRTGRSLKEKTSLADDNLKLD 
DMLS3LRDKWDTICGKSVERQNKLEEA\LLFSGQFTDALQALID 
WLYRVSPQLAEDQPVHGDIDLVMBLIDNHKAFQKELGKRTSSVQ 
ALKRSARELIEGSRDDSSMVKVQMQ3LSTP.W1 i ALSISKQTR 
LEAALRQAEEFHSWHALLEWLAEAEQTLRFHGVLPDDEDALRT 
LI DQHKEFMKKLEEKRAELNKATTMGDTVLAI CHPDS I TTI KHW 
I T I IRARFEEVLAWAKQHQQRLASALAGL IAKQELLEALLAWLQ 
WAETTLTDKDKEVIPQEIEEVKALIAEHQTFMEEMTRKQPDVDK 
VTKTYKRRAADPSSLQSHIPVLDKGRA3RKRFPASSLYPSGSQT 
QIETKNPRVNLLVSKWQQVWLLALERRRKLNDALDRLEELREFA 
NFDFDIWRKKYMRWMNHKKSRVMDFFRRIDKDQDGKITRQEFID 
G I LS S KFPTSRLEMSAVAD I FDRDGDGYID YYEFVAALHPNKDA 
YKPITDADKIEDEVTRQVAKCKCAKRFQVEQIGDNKYRFFLGNQ 
FGDSQQLRLVRILRSTVMVRVGGGWMALDEFLVKNDPCRAKGRT 
NMELREKFILADGASQGMAAFRPRGRRSRPSSRGASPNRSTSVS 
5QAAQAASPQVPATTTPKILHPLTRNYGKPWLTNSKMSTPCKAA 
ECSDFPVPSAEGTPIQGSKLRLPGYLSGKGFHSGEDSGLITTAA 
ARVRTQFADSKKTPSRPGSRAGSKAGSRASSRRGSDASDFDISE 
IQSVCSDVETVPQTHRPTPRAGSRPSTAKPSKI PTPQRKSPASK 
LDKSSKR 


£029 


- • 


3533 


IMPCGSSRLLRGCWTHPNEPVSDLSYFDCIESVMENSKVLGESM 
AGISQNAKTGDLPAFGECVGIASKALCGLTEAAAQAAYLVGIFD 
?I ISC A3HQGLVDP IQFARANQAI QMACQNLVD PGS S PSQVLSAA 
TIVAKHTSALCNACRIASSKTANPVAKRHFVQSAKEVANSTANL 
VKT I KALDGDFSEDNRNKCRI ATAPL IEAVENLTAFASNPE FVS 
IPAQISSEGSQAQEPILVSAKPMLESSSYLIRTARSLAINPKDP 
PT//SVLAGHSHTVSDSIKSLITSIRDKAPGQRECDYSIDGINRC 
IRDIEQASLAAVSQSLATRDDISVEALQEQLTSWQEIGHLIDP 
IATAARGEAAQLGHKGTQLASYFEPLILAAVGVASKILDHQQQM 
T VL DQ T KTLAESALQMLYAAKEGGGNPKAQHTHDAI TE AAQLMK 
EAVDDIMVTLNEAASEVGLVGGMVDAIAEAMSKLDEGTPPEPKG 
TFVDYQTTWKYSKAIAVTAQEMMTKSVTKPEELGGLASQMTSD 
r GHLZ FQGQMAAATAE PEE IGFQ IRTRVQDLGHGC I FLVQKAG\ 
ALQVCPTDSYTKRELIECARAVTEKVSLVLSALQAGNKGTQACI 
TAATAVS G 1 1 ADLDTTI MFATAGTLNAENSETFADHRENI LKTA 
KALVEDTKLLVSGAASTPDKLAQAAQSSAATITQLAEVVKLGAA 
SLGSDDPETQWLINAIKDVAKALSDLISATKGAASKPVDDPSM 
YCT/'--AAK'7MVTNVTSLLKTVKAV3DEATRGTRALEATIECIKQ 
ELTVFQSKDVPEKTSSPEESIRMTXGITMATAKAVAAGNSCRQE 
DVIATANLSRKAVSDMLTACKQASFHPDVSDEVRTRALRFGTEC | 
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ID 
NO: 
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amino acid 

amino acid 
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nucleotide 
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sequence 


Amine acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








TLGYLDLLEHVLVILQKPTPELKQQLAAFSKRVAGAVTELIQAA 
EAMKGTEWVDPEDPTVIAETELLGAAASIEAAAKKLEQLKPRAK 
PKQADETLDFSEQILEAAKSIAAATSALVKSASAAQRELVAQGK 
VGSIPANAADDGQWSQGLISAARMVAAATSSLCEAANASVQGHA 
SEEKLISSAKQVAASTAQLLVACKVKADQDSEAMRRLQAAGNAV 
KRASDNLVRAAQKAAFGKADDDDVWKTKFVGGIAQ I IAAQEEM 
LKKERELEEARKKLAQIRQQQYKFLPTELREDEG 


6030 


3 


1777 


FPGRGSPALQ1EVLICLGLMGLERALNVLAPIFYRNIVNLLTEN 
APWNSLAWTVTSYVFLKFLQGGGTGSTGFVSNLRTFLWIRVQQF 
TSRRVELLIFSHLHELSLRWHLGRRTGEVLRIADRGTSSVTGLL 
SYLVFNVIPTLADIIIGIIYFSMFFNAWFGLIVFLCMSLYLTLT 
IWTEWRTKFRRAMNTQENATRARAVDSLLNFETVKYYHAESYE 
VERYREAIIKYQGLEWKSSASLVLLNQTQNLVIGLGLLAGSLLC 
AYFVTEQKLQVGDYVLFGTYIIQLYMPLNWFGTYYRMIQTNFID 
MENMFDLLKK\ETEVKDLPGAGPFRFQKGRIEFENVHFSYADGR 
ETLQDVSFTVMPGQTLALVGPSGAGKSTILRLLFRFYDISSGCI 
RIDGQDISQVTQALFRFSHWELCPKDTVLFNDTIADNIRYGRVT 
AGNDEVEAAAQAAGIHDAIMAFPEGYRTOVGERGLKLSGGEKQR 
VAIARTILKAPGIILLDEATSALDTSNERAIQASLAKVCANRTT 
IWAHRLSTWNADQI LVI KDGCI VERGRHEALLSRGGVYADMW 
UiiUUUUit'rSKDTKPQTMER 


6031 


160 


1694 


LRMSENLEKSIWNEAGKSKSNDSEEGLEDAVECADEALQICAIKS 
DSSSPQRVQRPHSSPPRFVTVEET,LETARGVTNMALAHEIWNG 
DFQIKPVELPENSLKKRVKEIVHKAFNDCLSVQLSEDPPAYDHA 
I KLVG3 1 KETLLS FLLPGHTRLRNQI TEVLDLDL I KQEAENGAL 
DISKLAEFIIGMMGT1CAPARDEEVKKLKDIKEIVPLFREIFSV 
LDLMKVDMANFAI S S IRPHLMQQS VE YERKKFQE I LERQPNS LD 
FVTQWLEEASEDLMTQKYKHALPVGGMAAGSGDMPRLSPVAVQN 
YAYLKLLKWDHLQRPFPETVLMDQSRFHELQLQ\REQLTILGAV 
LLVTFSMAAPGISSQADFAEKLKMIVKILLTDMHLPSFHLKDVL 
TTIGEKVCLEVSS CLSLCGS S PFTTDKETVLKGQIQAVAS PDDP 
IRRIMESRILTFLETYLASGHQKPLPTVPGGLSPVQRELEEVAI 
KFARLVWYNKMVFCPYYDAILSKILVRS 


S032 


39 


2415 


AARLCRAQPTKSAWMIRDLSKMYPQTRHPAPHQPAQPFKFTISE 
SCDRI KEEFQFLQAQYHS LKLECEKLASE KTEMQRHYVMYYEMS 
YGLN I EMHKQAE IVKRLNAI CAQVIPFLS QEHQQQWQAVERAK 
QVTMAELNAIIGQQQLQAQHLSHGHGLPVPLTPHPSGLQPPAIP 
PIC-SSAGLLALSSALGGQSHLPIKDEKKHHDNDIIQRDRDSIKSS 
SVSPSASFRGAEKHRNSADYSSESKKQKTEEKEIAARYDSDGEK 
SDENLWDVSNEDPSSPRGSFAHSPRENGLDKTRLLKKDAPISP 
ASIASSSSTPSSKSKELSLNEKSTTPVSKSNTPTPRTDAPTPGS 
NSTPGLRPVPGKPPGVDPLASSLRTPMAVPCPYPTPFGIVPHAG 
KNGELTSPGAAYAGLHNISPQMSAAAAAAAAAAAYGRSPWGFD 
PHHHMRVPAIPPNLTGIPGGKPAYSFHVSADGQMQPVPFPPDAL 
IGPG I PRHARQ INTLNHGE WCAVTI SNPTRHVYTGGKGCVKVW 
DISHPGNKSPVSQLDCLNRDNYIRSCRLLPDGRTLIVGGEASTL 
S I WDLAAPTPR IKAELTSSAPACYALAI S PDSKVCFS CCSDGNI 
AVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRS 

EVLHVTKPDKYQLHLH3SCVLSIiKFAHCGKWF\VSTGKDNLLNA 
W\RTPYG\ASIF\QSKESSS\VLSCDI\SVDDKYIVTGS\SDK\ 
RATVYEVIY 




39 


2415 


AARLCRAQPTKSAWMIRDLSKMYPQTRHPAPHQPAQPFKFTISE " 
SCDRI KEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMS 
YGLNIEMHKQAEIVKRLNAICAQVIPFLSQEHQQCWQAVERAK 
QVTMAELKAIIGQQQLQAQHLSHGHGLPVPLTPHPSGLQPPAIP 
PJL;yt;AGLLALSSALGGQSHLPIKDEKKHHDNDHQRDRDSIKSS 
S VS PSAS FRGAEKERNSADYS SES KKQKTEEKE IAARYDSDGEK 
SDDNLWDVSNED P S S PRGS PAHS PRENGLDKTRLLKKDAP ISP 
ASIASSSSTPSSKSKELSLNEKSTTPVSKSNTPTPRTDAPTPGS 
NSTPGLRPVPGKPPGVDPLASSLRTPMAVPCPYPTPFGIVPHAG 



433 



WO 01/53312 



PCT/US00/34263 



SEQ 


Predicted 
beginning 
nucleotide 

corresponding 
to first 
amino acid 

amino acid 
sequence 


Predicted end 

to first 
amine acid 
residue of 

sequence 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E = 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L»Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q^Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








MNGiiLTSPGAAYAGLHNISPQMSAAAAAAAAAAAYGRSPVVGFD 
PHffllMRVPAIPPNLTGIPGGKPAYSFHVSADGQMQPVPFPPDAL 
IGPGIPRHARQINTLNHGEWCAVTISNPTRHVYTGGKGCVKVW 
DISHPGNKSPVSQLDCLNRDKYIRSCRLLPDGRTLIVGGEASTL 
SIWDLAAPTPRIKJUSLTSSAPACYALAISPDSKVCFSCCSDGNI 
AVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRS 
W\DLREGRQLQQHD/FFTSPVFSLGYCP\TEEWLAVGMENSN\V 
EVLHVTKPDKYQLHLHESCVLSLKFAHCGKWFWSTGKDNLLNA 
W\RTFYG\ASIF\QSKESSS\VLSCDI\SVDDKYIVTGS\GDK\ 
RATVYEVIY 


6034 


2S83 


714 


ESGRRRRLKRRRSPCPGTAGGPGETNPGPGACPRGPREEAAAAM 
EIAPQEAPPVPGADGDIEEAPAEAGSPSPASPPADGRLKAAAKR 
VTFPSDEDIVSGAVEPKDPWRHAQNVTVDEVIGAYKQACQKLNC 
RQIPKLLRQLQEFTDLGHRLDCLDLKGEKLDYKTCEALEEVFKR 
LQFKWDLEQTNLDEDGASALFDMIEYYESATHLNISFNKHIGT 
RGWQAAAHMMRKTSCLQYL\DARNTPLLDHSAPFVARALRIRSS 
LAVLHLENASLSGRPLMLLATALKMNMNLRELYL\ADNKLNGLQ 
DSAQLGNLLKFNCSLQILDLRNNHVLDSGLAYICEGLKEQRKGL 
VTL\VLWNNQLTHTGMAFLGKTLPHTQSLETLNLGHNPIGNEGV 
RHLKNGLISNRSVLRLGLASTKLTCESAVAVAEFIAESPRLLRL 
DLRENEI KTGGLMALS LAL KVNHS LLRLDLDRE PKKEAVKS FIE 
TQKALLAEIQNGCKRNLVLAREREEKEQPPQLSASMPETTATEP 
QPDDEPAAGVQNGAPSPAPSPDSDSDSDSDGEEEEEEEGEREET 
PSGAIDTRDTGSSEPQPPPEPPRSGPPLPNGLKPEFALALPPEP 
PPGPEVKGGSCGLEHELSCSKNEKELEELLLEASQESGQETL 


S035 


19 


404 


i uGIILHKNTGALPADPVQLISQTPTPSTKQaii'LSFLGMVG 
YFYLWIPGFAILTKPLCKLTKENLADAIDPKSFSHSSFRSLKTA 
LENAS TLALPDS SQ P F \ S LHTAE VQG C WE ILTQGLGPLPV 


6036 


1745 


356 


LPDVEKLGRRRGRKXJ_ - E 3A/V1 /SNl-P.G IP -,r < -b PPKLQRN 
SRGGQGRGVEKPPHLAALILARGGSKGIPLKNIKHLAGVPLIGW 
VLRAALDSGAFQSVWVSTDHDEIENVAKQFGAQVHRRSSEVSKD 
SSTSLDAIIEFLNYHNEVDIVGNIQATSPCLHPTDLQKVAEMIR 
EEGYDSVFSWRRHQFRWSEIQKGVREVTEPLNLNPAKRPRRQD 
WDGELYENGSFYFAKRHLIEM3YLQGGKMAYYEMRAEHSVDIDV 
DIDWPIAEQRVLRYGYFGKEKLKEIKLLVCNIDGCLTNGHIYVS 
GDQKEIISYDVKDAIGISLLKKSGIEVRLISERACSKQTLSSLK 
LDCKMEVSVSDKLAWDEWRKEMGLCWKEVAYLGNEVSDEECLK 
RVGLSGAPADACSTAQKAVGYICKCNGGRGA\IREFAEHIC\LL 
MEKGLINFMPKNRNLAVNIGEKK 


6037 


2936 


m 


WTS I, I13SVLTILLFSLQGNKMLKYSAPSAGGYLLPRKPVGTPA 
3GGFPRRHSVTLPSSKFRQNQLLSSLKGEPAPALSSRDSRFRDR 
SFSEGGERLLPTQKQPGGGQVNSSRYKT\ELCRPFEENGACKYG 
DKCQFAHGIHELRSLTRHPKYXTELCRTFHTIGFCPYGPRCHFI 
HNAEERRALAGARDLSADRPRLQHSFSFAGFPSAAATAAATGLL 
DSPTSITPPPILSADDLLGSPTLPDGTNNPF\AFSSQELASLFA 
PSKGLPGGGSPTTFLFRPMSESPHMFDSPPSPQDSLSDQEGYLS 
SSSSSHSGSDSPTLDNSRRLPIFSRLSISDD 


603S 


1450 




t < T] HA.KKLKALEAMKNKQKSVTAKDSAKTTFTSITTNTINTS 
SDXTDGTAGTPAISTTTTVEIRKSSVMTTSITSKVEKSPTTATG 
NSSCPSTETEEEKAKRLL\YCSLCKVAVNSASQLEAffiTSGTKHK 
TMLEARNG3GTIKAFPRAGVKGKGPVNKGNTGLQNKTFHCEICD 
VHVKSETQLKQHISSRRHKDRAAGKPPKPKYSPYNKLQKTAHPL 
GVKLVFSKEPSKPLAPRILPNPLAAAAAAAAVAVSSPFSLRTAP 
AATLFQTSALPPALLRPAPGP-RTAHTPVLFAPY 


6039 


4073 


1000 


LDEYEARLTLANLDDFEEDHEDDDENEWQEEKAAKITELINKL 
NFLDEAEKDLATVNSNPFDDPDAAELKPFGDPDSEEPITETASP 
RKTEDSFYNNSYNPFKEVQTPQYLNPFDEPEAFVTIKDSPPQST 
KKKKIRPVDMSKYLYADSSKTEEEELDESWPFYEPKSTPPPNKL 
WPVQELETERRVKRKAPAPPVLSPKTGVLNEMTVSAGKDLSTS 
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ID 
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amino acid 


Amine acid segment containing signal peptide 
<A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine , V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








PKPSPIPSPVLGRKPNASQSLLVWCKEVTKNYRGVKITNFTTSW 
RNGLSFCAILHHFRPDLIDYKSLNPQDIKENNKKAYDGFASIGI 
SRLLEPSDMVLLAIPDKLTVMTYLYQIRAHFSGQELHWQIEEN 
SSKSTYKVGNYETDTNSSVDQEKFYAELSDLKREPELQQPISGA 
VDFLSQDDSVFVNDSGVGESESEHQTPDDHLSPSTASPYCRRTK 
SDTEPQKSQQSSGRTSGSDDPGICSNTDSTQAQVLLGKKRLLKA 
ETLELSDLYVSDKKKDMSPPFICEETDEQKLQTLDIGSNLEKEK 
LENSRSLECRSDPESPIKKTSLSPT3KLGYSYSRDLDLAKKKHA 
SLRQTESDPDADRTTLNHADHSSKIVQHRLLSRQEELKERARVL 
LEQARRDAALKAGNKHNTNTAAPFCNRQLSDQQDEERRRQLRER 
ARQLIAEARSGGKMSELPSYGERAAEKLKERSKASGDENDNIEI 
DTNEEIPEGFWGGGDELTNLENDLDTPEQNSKLVDLKLKKLLE 
VQPQVANSPSSAAQKAVTESSEQDMKS3TEDLRTERLQKTTERF 
RNPWFSKDSTVRKTQLQSFSQYIENRPEMKRQRSIQEDTKKGN 
EEKAAITETQRKPSEDEVLNKGFKDS\SQYWGELAALENEQKQ 
IDTRAALVEKRLRYLMDTGRNTEEEEAMMQEWFMLWIKKNALIR 
RMNQLSLLEKEHDLERRYELLNRELRAMLAIEDWQKTBAQKRRE 
QLLLDELVALVMKRDALVRDLDAQEKQAEEEDEHLERTLEQ"'v'!0-: 
KMAKXEKKCVLQ 


6040 


475 


1052 


PTALMTAPSCAFPVQFRQPSVSGLSQITKSLYISNGVAANNKLM 
LSSNQITMVINVSVEWNTLYEDIQYMQVPVADSPNSRLCDFFD 
PIADHIHSVEMKQGR\TLLHCAAGVSRSAALCLAYLMKYHAMSL 
LDAHTWTKSCRPIIRPNSGFWEQLIHYEFOLFGICNTVHMVSSPV 
GMIPDIYEKEVRLMIPL 


6041 




3 886 


TEK2EKTAHNLEN i 1 1- ' .-IE =>T ,S - ICVAKISEPEADVESVLGVS 
NLLCVLQKPKGSLKSSKKKNGKVRFADEILESNKENEKCVSSEG 
EKIECWELTTEPSLTHNSSGLLSPLRKKPLEDLVCKLADISINY 
VNERKSEQHLRFLSTLLDSFSSSRVFKMLLGDEKQSIVQAKPLE 
IAKLVQKNPAVQFLYQKLIGWLNEDQRKDFGFLVDILYSALRCC 
DNDMERKKVLDDLTKVDLKWNSLLKllEKACPSSDKHALVTPWlj 
KGDILGEKLVNLADCLCNEDLESRVSSESHFSERWTLLSLVLSQ 
HVKNDYLIGDVYVERIIVRLHETLFKTKKLSEAESSDSSVSFIC 
DVAYNYFSSAKGCLLMPSSEDLLLTLFQLCAQSKEKTHLPDFLI 
CKLKNTWLSGVNLLVHQTDSSYKESTFLHLSALWLKNQVQASSL 
DINSLQVLLSAVDDLLNTLLESEDSYLMGVYIGSVMPNDSEWEK 
NRQSLPMQWLHRPLLEGRLSLNYECFKTDFKEQDIKTLPSHLCT 
SALLSKMVLIALRKETVLENNELEKIIAELLYSLQWCEELDNPP 
IFLIGFCEILQKMNITYDNLRVLGNMSGLLQLLFNRSREHGTLW 
SLIIAKLILSRSISSDEVKPHYKRKESFFPLTEGNLHTIQSLCP 
FLS KEEKKEFSAQC I PALLGWTKKDLCS TNGGFGHLAI FNSCLQ 
TKSIDDGELLHGILKIIISWKKEHEDIFLFSCNLSEASPEVLGV 
NIEIIRFLSLFLKYCSSPLAESEWDFIMCSMLAWLETTSENQAL 
YS-P1VQLFACVSCDLACDLSAFFDSTTLDTIGNLPVWLISEWK 
EFFSQGIHSLLLPILVTVTGENKDVSETSFQNAMLKPMCETLTY 
ISKEQLLSHKLPARLVADQKTNLPEYLQTLLNTLAPLLLFRARP 
VQIAVYHMLYKLMPELPQYDQDNLKSYGDEEEEPALSPPAALMS 
LLSIQEDLLENVLGCIPVGQIVTIKPLSEDFCYVLGY1LTWKLI 
LTFFKAASSQLRALYSMYLRKTKSLMKLLYHLFRLMPENPTYAE 
T?,Vrr;./Ji\ T KDPKTFFTEELQLSIRETTMLPYHIPHLACSVYHMTL 
KDLPAMVRLWWNSSEKRVFNIVDRFTSKYVSSVLSFQEISSVQT 
STQL FNGMTVKARATTREVMAT YT 1 ED I VI EL I IQIiPSNYPLGS 
1 1 VES 3XRVGVAVQQWRNWMLQLSTYLTHONGS I MEGLALWKNN 
VDKRFEGVEDCMICFSVIHGFNYSLPKKACRTCKKKFHSA\CLY 
KWFTSSNKSTCSLCRETFF 


6042 


1306 


253 


ASPSDIKASVSNGDTTLLCSRRQSCGMKTEVRQVSLTYP 
GSFAPSHSLPLQPRSGGSLCPSRAW/PDPHQLFDDTSSAQSRGY 
GACRAPGGLSYPAASPTPHAAFLADPVSNMAMAYGSSLAAQGKE 
LVE I <N TDRF IP I TKLKYYFAVDTMYVGRXLGLLFFPYLHQDWEV 
QYQQDTPVAPRFDVNAPDLYI PAMAFI TYVLVAGLALGTQDRFS 
i -j-_QASSALAWIjTLEVLAILLSLYLVTVNTDLTTIDLVAFIi 
GYKYVGMIGGVLMGLLFGKIGYYLVLGWCCVAIFVFMIRTLRLK 
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to first 
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amino acid 


Amino acid segment containing signal peptide 
<A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G-Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutaraine, R=Arginine, 
S=Serine, T=Threonine , V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








iLADAAAEGVPVRGARNQLRMYLTMAVAAAQPMLMYWLTFHLVR 


6043 


403 


599 


LCLFFPFPCATPVLPLPSLISAL/CLSHLSVSSWFCPCQPPLPC 
PLPPLQNKTAKGSLSTEQSERG 


6044 


793 


412 


KLEKWNFTLISKVKISREVTMIASKFGIGQQVRHSLLGYLGVVV 
DIDPVYSLSEPSPDELAVNDELRAAPWYHVVMEDDNGLPVHTYL 
AEAQLSSELQDEHP\EQPSMDELAQTIRKQLQAPRLRN 


6045 


155 


2299 


SPLPQVAAMNYLRRRLSDSNFMANLPNGYMTDLQRPQPPPPPPG 
AHSPGATPGPGTATAERSSGVAPAASPAAPSPGSSGGGGFFSSL 
SrJAVKQTTAAAAATFSEQVGGGSGGAGRGGAASRVLLVIDEPHT 
DWAKYFKGKKIHGEIDIKVEQAEFSDLNLVAHANGGFSVDMEVL 
RNGVKWRSLKPDFVLIRQHAFSI4ARNGDYRSLVIGLQYAGIPS 
VNSLHSVYNFCDKPWVFAQMVRLHKKLGTEEFPLIDQTFYPNHK 
EMLSS\TTYPVWKMGHGTLWGWGKVKVDNQHDFQDIASWALT 
KTYATAEPFIDAKYDVRVQKIGQNYKAYMRTSVSGNWKTNTGSA 
MLEQIAMSDRYKLWVDTCSEIFGGLDI CAVEA1HG KDGRDH 1 1 E 
WGSSMPLIGDHQDEDKQLIVEL^/VNKMAQALPRQRQRDASPGR 
GSHGQTPSPGALPLGRQTSQQPAGPPAQQRPPPQGGPPQPGPGP 
QRQGPPLQQRPPPQGQQHLSGLGPPAGSPLPQRLPSPTSAPQQP 
ASQAAPPTQGQGRQSRPVAGGPGAPPAARPPASPSPQRQAGPPQ 
ATRQTSVSGPAPPKASGAPPGGQQRQGPPQKPPGPAGPTRQASO 
AGPVPRTGPPTTQQPRPSGPGPAGRPKPQLAOKPSODVPPPATA 
AAGGPPHPQLNKSOSLTNAFNLPEPAPPRPSLSQDEVKAF.TIRS 
LRKS FASLFSD 


6046 


212 


1075 


EGLTGPCERVPFI.LGRGPPHGATRAGHRRAVRWAGPESLPPLPR 
SLIMDSPRAGTHQGPLDAETEVGADRCTSTAYQEQRPQVEQVGK 
QAPLSPGLPAMGGPGPGPCEDPACAGGAGAGGSEPLVTVTVQCA 
FTVALRARRGADLSSLRALLGQALPHQXAQLGQLSYIAPGEDGH 
WVPIPEEESLQRAWQDAAACPRGLQLQCRGAGGRPVLYQWAQH 
SYSAQGPEDLGFRQGDTVDVLCEVDQAWLEGHCDGRIGIFPKCF 
WPAGPRMSGAPGRLPRSQQGDQP 


6047 


49 


14 0 5 


PVLVTSLRMREADTLRPPQLMEVSADIISTVEFNHTGELLATGD 
KGGRW1FQREPESKNAPHSQGEYDVY3TFQSHEPEFDYLKSLE 
IEEKINKIKWLPQQNAAHSLLSTNDKTIKLWKITERDKRPEGYN 
LKDEEGKLKDLSTVTSLQVPVLKPMDLMVEVSPRRIFANGHTYH 
INSISVNSDCETYMSADDLRINLWHLAITDRSFTP\NIVDIKPA 
XMEDLTEVITASEFHPHHCNLFVYSSSKGSLRLCDMRAAALCDK 
HSI 1 _ EE~EDPSNRSFFSEIIS\SVSDVKFSHSDRYMLTR\DYL 
TVKVWDL\NMEARPIETYQVHDYLRSKLCSLYENDCIFDKFECA 
' N I ^V1MTGA\YNNFFRMFDRKTKRDVTL\EASRESSKPRAV 
LKPRRVCVGGKRRRDDISVDSLDFTKKILHTAWHPAENIIAIAA 
TNNLYI FQDKVNSDMH 


604B 




3194 


G-RTPKFCDSPTSDLEMRNGRGRGKRMRPNSNTPVNETATASDS 
KGTSNSSKTRAGANSKGRRGSQNSSEHRPPASSTSEDVKASPSS 
ANKRKNKPLSDMELNSSSEDSKGSKRVRTNSNGSATGPLPGTKV 
3 PT VLDRNCPS PVL IDCPHPNCNKKYKHINGLKYHQAHAHTDDD 
SKPEADGDSEYGEEPILHADLGSCNG\ASVSQK\GSLSPARSAT 
PKVRLVEPHSPSPSSKFSTKGLCKKKLSGEGDTDLGALSNDGSD 
DGPSVMDETSNDAFDSLERKCMEKEKCKKPSSLKPEKIPSKSLK 
S ARPI /APLAI P PQQI YTFQTATFTAAS PGSSSGLTATVAQAMP 
NSPQLKPIQPKPTVMGEPFTVNPALTPAKDKKKKDKKKKESSKE 
LESPLTPGKVCRAEEGKSPFRESSGNGMKMEGLLNGSSDPHQSR 
i U! cj,l-J'.EADKIYSFTDNAPSPSIGGSSIlLENTTPTQPLTPLHW 
TQl\G.?iEASSVKTNSPAYSDISDAGEDGEGKVDSVKSKDAEQLVK 
EGAKKTLFPPQPQSKDSPYYQGFESYYSPSYAQSSPGALNPSSQ 
AGVESQALKTKRDEEPESIEGKVKNDICEEKKPELSSSSQQPSV 
IQQRPNMYMQSLYYNQYAYVPPYGYSDQSYHTHLLSTNTAYRQQ 
_L 2 <F QS LEQQQRGVDKKAEMGLKEREAALKEEWKQKPS IP P 
TLTKAPSLTDLVKSGPGKAKEPGADPAKSVIIPKLDDSSKLPGQ 
APEGLKVKLSDASHLSKEASEAKTGAECGRQAEMDPILWYRQEA 
EPRMWTYVYPAKYSDIKSEDERWKEERDRKLKEERSRSKDSVPK 
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Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








E3GKESTSSDCKLPTSEESRLGSKEPRPSVHVPVSSPLTQHQSY 
IFYMHGYSYSQSYDPNHPSYRSMPAVMMQNYPGSYLPSSYSFSP 
YGSKVSGGEDADKARASPSVTCKSSSESKALDILQQHASHYKSK 
SPTISDKTSQERDRGGCGWGGGGSCSSVGGASGGERSVDRPRT 
SPSQRLMSTHHHHHHLGYSLLPAQYNLPYAAGLSSTAIVASQQG 
STP3LYPPPRR 


6049 


215 




AMTGVFDRRVPSIRSGDFQAPFQTSAAMHHPSQESPTLPESSAT 
DSDYYSPTGGAPHGYCSPTSASYG\KALNPYQYQYHGVNGSAGS 
Y PAKAYADYS YAS S YHQYGGAYNRVPSATNQPEKEVTEPEVRMV 
NGKPKKVRKPRTIYSSFQLAALQRRFQKTQYIALPERAELAASL 
GLTQTQVKIWFQNKRSKIKKIMKNGEMPPEHSPSSSDPMACNSP 
QS PAVWE PQGSSRSLSHHPHAHP PTSKQS PAS S YLENSASWYTS 
AASSINSHLPPPGSLQHPLALASGTLY 


5050 


566 


1718 


KGLERTCCAMEESDSEKTTEKENLGPRMDPPLGEPGVGSLGWVL 
PNTAMKKKVLLMGKSGSGKTSMRS 1 1 FANYIARDTRRLGATILD 
RIHSLQINSSLSTYS LVDS VGNTKTFDVEHSHVRFLGNLVLNLW 
DCGGQDTFMENYFTSQRDNIFRNVEVLIYVFDVESRELEKDMHY 
YQSCLEAILQMSPDAKIFCLVHKMDLVQEDQRDLIFKEREEDLR 
RLSRPLECS CFRTS I WDETLYKAWS3 IVYQLI PNVQQLEMNLRN 
FAEIIEADEVLLFERATFLVISHYQCKEQRDAHRFEKISNIIKQ 
FKLSCSKLAAS FQSME VRNSNFAAF I DI FTSNTYVMWMSDPS I 
PSAATLINIRNARKHFEKLERVDGPKQCLLMR 


£051 


566 


1718 


KGLERrCCAMEESDSEKTTEKENLG?RMDPPLGEPG\GSLGWVI, 
PNTAMKKKVLLMGKSGSGKTSMRSIIFANYIARDTRRLGATILD 

DCGGQDTFMENYFTSQRDNIFRNVEVLIYVFDVESRELEKDMHY 
YQSCLEAILQNSPDAKIFCLVHKMDLVQEDQRDLIFKEREEDLR 
RLSRPLECSCFRTSIWDETLYKAWSSIVYQLIPNVQQLEMNLRN 
FAE 1 1 EADEVLLFERATFLVI SHYQCKEQRDAHRFEKI SNI I KQ 
FKLSCSKLAAS FQSMEVRNSNFAAFIDIFTSNTYVMWMSDPSI 
PSAATLINIRNARKHFEKLERVDGPKQCLLMR 


6052 


= 5S 


1718 


KGLERTCCAMEESDSEKTTEKENLGPRMDPPLGEPG\GSLGWVL 
PNTAMKKKVLLMGKSGSGKTSMRSIIFANYIARDTRRLGATILD 
RIHSLQINSSLSTYSLVDSVGNTKTFDVEHSHVRFLGNLVLNLW 
DCGGQDTFMENYFTSQRDNIFRIWEVLIYVFDVESRELEKDMHY 
YQS CLEAILQNS PDAKI FCLVHKMDLVQEDQRDL I FKEREEDLR 
RLSRPLECSCFRTSIWDETLYKAWSS1VYQLIPNVQQLEMNLRN 
FAEIIEADEVLLFERATFLVISHYQCKEQRDAHRFEKISNIIKQ 
FKLSCSKLAAS FQSMEVRWSNFAAFIDIFTSNTYVMWMSDPS I 
PSAATLINIRNARKHFEKLERVDGPKQCLLMR 






1704 


KGTEMNKSRWQSRRRHGRRSHQQNPWFRLRDSEDRSDSRAAQPA ~" 

HDSGHGDDESPSTSSGTAGTSSVPELPGFYFDPEKKRYFRLLPG 

HNNCNPLTKES1RQKEMESKRLRLLQEEDRRKKIARMGFNASSM 

LRKSQLGFLNVTNYCHLAHELRLSCMERKKVQIRSMDPSALASD 

RFNLILADTNSDRLFTVNDVTVGGSKYGIIKTLQSLKTPTLKVFM 

HENLYFTNRKV\NSVCWASLNHLDSHILLCLMGLAETPGCATLL 

?ASLFVNSHPAGIDRPG\MLCSFRIPGAWSCAWSLNIQANNCFS 

TGLSRRVLLTr7WTGHRQSFGTNSDVLAQQFALMAPLLFNGCRS 

GEIFAIDLRCGNQGKGWKATRLFHDSAVTSVRILQDEQYLMASD 

M/jJK^i-'.LWDLRTTKCVRQYEGHVNEYAYLPLHVHEEEGILVAVG 

QDCYTRIWSLHDARLLRTIPSPYPASKADIPSVAFSSRLGGSRG 

APGLLMAVGQDLYCYSYS 


6054 


1 


1054 


PPIARLQE FGTSRRHMAAPSGVHLLVRRGSHRIFS SPLNHI YLH 
KQSSSQQRRNFFFRRQRDISHSIVLPAAVSSAHPVPKHIKKPDY 
VTTGIVPDMGDSIEVKNSDQIQGLHQACQLARHVLLLAGKSLKV 
DMTTEEIDALVHREIISHNAYPSPLGYGGFPKSVCTSVWNVLCH 
GIPDSRPLQDGDIIN1DVTVYYNGYHGDTSETFLVGNVDECGKK 
LVEVARRCRDEAIAACRAGAPFSVIGNTISHITHQNGFQVCPHF 
VGHG 1 GS YFHGHPE IWHHANDSDLPMEEGMAFT IEP I ITEGSPE 
FKVLEDAWTWSLD/TSKVSAQFEHTVLITSRGAQILTKLPHEA | 
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(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine , 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 




421 


2364 


PPYFLLSFIAWWLYGQSDRTETDISQSAGPPPGTLQCSALHHDP 
GCANCSRFCRDCSPPACQCHTIIVFPGNALNGVQPPELSRTLALI 
SSREPPRKKKKSQTETGKERERTSFLTQGGKRFELQHGLAGICM 
TLLITGDSIVSAEAVWDHVTKANEELAFKAGDVIKVLDASNKDW 
WWGQIDDEEGWFPASFVRLWVWHEDEVEBGPSDVQNGHLDPNSD 
CLCLGRPLQNRDQMRANVINEIMSTERHYIKHLKDICEGYLKQC 
RKRRDMFS DEQLKVI FGN I EDI YRFQMGFVRDLEKQYNNDDPHL 
SEIGPCFLEHQDGFWIYSEYCNNHLDACMELSKLMKDSRYQHFF 
EACRLLQ0MIDIA\ IDGFLLTPVQKI GKYPLQLAELLKYTAQDH 
SDYRYVAAAIAVMRNVTQQINERKRRLENIDKIAQWQASVLDWE 
GEDILDRSSELIYTGEMAWIYQP\YGRNQQRVFFLFDHQMVLCK 
KDLIRRDILYYKGRIDMDKYEWDIEDGRDDDFNV5MKHAFKLH 
NKETEEIHLFFAKKLEEKIRWLRAFREERKMVQEDEKIGFEISE 
NQKRQAAMTWKVPKQKGWSARSVPPSYPPPQDPLKHGQYLVP 
\DG1AQSQVFEFTEPKRSQSPFWQNFSRLTPFKK 


5056 


43 


3358 


SGGRGPVRVRSEQLSFSAEQVSQISQISLGRRPLSSLPPPPSRA 
IAPTRAPDTALTIMEVAEVESPLNPSCKIMTFRP3MEEFREFNK 
YLAYMESKGAHRAGLAKVIPPKEWKPRQCYDDIDNLLIPAPIQQ 
MVTGQSGLFTQYNIQKKAMTVKEFRQLANSGKYCTPRYLDYEDL 
ERKYWKNLTFVAP I YGADINGS I YDEGVDEWNIARLNTVLDWE 
EECGISIEGV1NTPYLYFGMWKTTFAWHTEDMDLYSINYLHFGEP 
KSWYAIPPEHGKRLERLAQGFFPSSSQGCDAFLRHKiyiTLISPSV 
LKKYGIPFDKITQEAGEFMITFPYGYHAGFNHGFNCAESTNFAT 
VRK IDYGKVAKLCTCRKDMVKI SMD I FVRKFQPDRYQLWKQGKD 
IYTIDHTKPTPAGTPEVKAWIjQRRRKVRKASRS FQCARSTSKRP 
KADEEEEVSDEVDGAEVPNPDSVTDDLKVSEKSEAAVKLRNTEA 
SSEEESSASRMQVEQNLSDHIKLSGNSCLSTSVTEDIKTEDDKA 
YAYRSVPSISSEADDSIPLSTGYEKPEKSDPSELSWPKSPESCS 
ESN f p- rrr p-'j [ j j p H - ~- 

IAEGENKTSKSWRHPLSRPPARSPMTLVKQQAPSDEELPEVLSI 
3EEVEETESWAKPLIHLWQTKPPNFAAEQEYNATVARMKPHCAI 
CTLLMPYHKPDSSIIEENDARWETKLDEWTSEGKTKPLIPEMCF 
IYSEENIEYSPPNAFLEEDGTSLLISCAKCCVRVHASCYGIPSH 
EICDGHLCARCKRNAWTAECCLCNLRGGALKQTKNNKWAHVMCA 
VAVPEVRFTNVPERTQIDVGRIPLQRLKLKCIFCRHRVKRVSGA 
CIQCSYGRCPASFHVTGAf-AAGVlAMEPDDWPYVVNITCFRHKV 
KPNVKSKACEKVISVGQTVITKHRNTRYY8CRVMAVTSQTFYEV 
MFDDGSFSRDTFPEDIVSRDCLKLGPPAEGEWQVKWPDGKLYG 
ifFGSI IAHMYQVEFEDGSQIAMKREDIYTIjDEELPKRVKARF 
VSAGRCHLGTCQVNSLSSFHVSQAQQETYLGFWINSKKSQCNIF 
LSGTY 


6057 


1 


853 


FVARLKEQEGEGGLGPRKEKGRARGRERRRKMQLTRCCFVFLVQ 
GSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKS 
RP^LANSTLLGLLAPPGEAWGILGQPPlsRPNHSPPPSAKVKKIFG 
WGDFYSNIKTVALNLLVTGKIVDHGNGTFSVHFQHNATGQGNIS 
ISLVPPSKAVEFHQEQQIFIEAKASKIFNC\RMEWEKVE\RGRR 
TSLFTHDPAKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYR 
LVQKVCPDYNYHSDTPYYPSG 


605B 


1 




HPLPSASLGIiPSVSLGVSLCVRSALLEAWPMLPKRRRARVGSP 
^PA^SSTPPSTRFPGVAIYLVEPRMGRSRRAFLTGLARSKGFR 
VLDACSSEATHWMEETSAEEAVSWQERRMAAAPPGCTPPALLD 
ISWLTESLGAGQPVPVECRHRLEVAGPSKGPLSPAWMPAYACQR 
P?P1THHN?GI,SEALEILAEAAGFEGSEGRLLTFCRAASVLKAL 
PSPVTTLSQLQGLPHFGEHSSRWQELLEHGVCEEVERVRRSE/ 
RLFTQ I FGVGVKTADRWYREGLRTLDDLREQ P QKLTQQQKAGB P 
E, '.TWASLNCTLDPSASTP 


6059 


2 


3650 


' E -.bLADLTDHRAHRCPGDGDDDPQLSWVASSPSSKDVASPT 
QMIGDGCDLGLGEEEGGTGLPYPCQFCDKSFIRLSYLKRHEQIH 
, I K] -'FPCCTYCSRLFKHKRSRDRHIKLHTGDKKYHCHECEAAFS 
RSDHLKIHLKTHSSSKPFKCTVCKRGFSSTSSLQSHMQAHKKNK 
EHLAXSEKEAKKDDFMCDYCEDTFSQTEELEKHVLTRHPQLSEK 
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W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 



ilble 



uclec 



ide : 



adlqcihcpevfvdentllahihqahanqkh:<cpmcpe\qfssv " 
\egvychldshrqpdssmhsvspdpvlgsvasmssatpdssasv 
ergstpdstlkpij^gqkkmrddgqgwtkvvyscpycskrdfnsii 
avle ihlkt ihadkpqqshtcq i cldsmptl ynlnehvrklhkn 
haypvmqfgnisafhcnycpemfadinslqehirvshcgpnanp 
sdgnnaffcnqcsmgflt3ssltehiq\q\ahcsvgsaklespv 
vqptqs fmevyscp yctns p i fgs ilkltkh ikenhkni plahs 
kkskaeqspvssdvevsspkrqrlsasansisngeypcnqcdlk 
fsnfesfqthlklhlell1rkqacpqckedfdsqesllqhltvh 
ymttsthyvcescdkqfssvdd\lqkh\lldmphplccthct\l 
cqevfds \ kvs i \qvhlavkhsnekkmyrctacnwdfrkeadlq 
vhvkhshlgnpakahkc i fcgetfs tevelqchi tths kkynck 
fcskafhaiillekhlrekhcvfdaatengtangvppmatkkae 
padlqgmllkmpeapnsheaseddvdasepmygcdicgaaytme 
vllqnhrlrdhnirpgeddgsrkkaefikgshkcwcsrtffse 
nglrehlqthpj3pakhymcpicgerfpslltltehkvthsksld 
tgtcrickmplqseeefiehcqmhpdlrnsltgfrcwcmqtvt 
stlelkihgtfhmqklagssaasspngqglqklykcalclkefr 
skqdlvkldvmglpyglcagcmarsangqvggiiappepadrpca 
glrcpecsvkfesaedleshmqvdhrdltpetsgprkgtqtspv 
prkktyqcikcqmtfenereiqihvakhmieeginhecklcnc2m 
fdspakllchliehsfegmggtfkcpvcftvfvqanklqqhifa 
vhgqedxiydcsqcpckfffqtelqnhtmsqhaq 



3 Y2 IVGKWKLRVNHSQLKALCKCSLPSRLLPLGENLPLLDRGFR ~~ 

CLCFRSWLPAMLAVLLSLAPSASS 
DISASRPNILLLMADDLGIGDIGCYGIOWTMRTPNIDRIAEDGl/K 
LTQHISAASLCTPSRAAFLTGRYPVRSGMVSSIGYRVLQWTGAS 
GGLPTNETTFAKILEEKGYATGLIGKWHLGLNCESASDHCHHPL 
HHGFDHFYGMPFSLMGDCARWELSEKRVNLEQKLNFLFQVIALV 
ALTLVAGKLTKLIPVSWMPVIWSALSAVLLLASSYFVGALIVHA 
DCFLMRNHTITEQPMCFQRTTPLILQEVASFLKRNKHGPFLLFV 
SFLHVHIPLITMENFLGKSLHGLYGDHVKEMDWMVGRILDTLDV 
EGLSNSTLIYFTSDHGGSLENQLGNTQYGGWNGIYKGGKC-MGGW 
EGGIRVPGIFRWPGVLPAGRVIGEPTSLMDVFPTWRLAGSEVP 
QDRVIDGQDLLPLLLGTAQHSDHEFLMHYCERFLHAARWHQRDR 
GTMWKVHFVTEVFQPEGAGACYGRKVCPCFGEKWHHDPFLLFD 
LSRDPSETHILTPASEPVFYQVMER\VQQAVWEHQRTLSPVPLQ 
LDRLGNIWRPWLQPCCGPFPLCWCLREDDPQ 



MNIHMKRKTIKKINTFENRMLMLDGMPAVRVKTELLESEQGSPH 
VHNYPDMEAVPLLLNNVKGEPPEDSLSVDHFQTQTEPVDLSIITK 
ARTSPTAVSSSPVSMTASASSPSSTSTSSSSSSRLASSPTVITS 
VSSASSSSTVLTPGPLVASASGVGGQQFLHIIHPVPPSSPMNLQ 
SNKLSHVHRIPVWQSVPWYTAVRSPGITO^IVVPLLEDGRG 
HGKAQMDPRGLSPRQSKSDSDDDDLPNVTLDSVNETGSTALSIA 
RAVQEVHPSPVSRVRGNRMNNQKFPCSISPFSIESTRRQRTVLN 
PPDSRKTAYSTDCDF\EGLQQKLYTKSSSPGRVHRRTHTGEKPY 
KCTWEGCTWKFARSDELTRHYRKHTGVKPFKCADCDRSFSRSDH 
LALHRRRHMLV 



ETMAKNGPEKCEDCHILNAEAFKSKKICKSLKICGLVFGILALT 
LIVLFWGSKHFWPEVPKKAYDMEHTFYSNGEKKKIYMEIDPVTR 
TEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
EFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKSTSKI 
LE I CDKVTMYW\ IKPTL\ISGTFAKQLHHNFAFI ILVSELQDFE 
EEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELP 
INDYTENGIEFDPKLDERGYCCIYCRRGNRYCRRVCEPLL.GYYP 
YPYCYQGGRVICRVIMPCNWWVARMLGRV 



ETMAKNGPENCEDCH ILWAEAFKS KKICKSLKI CGLVFGI LALT 
LIVLFWGSKHFWPEVPKKAYDMEHTFYSNGEKKKIYMEIDPVTR 
TEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
BFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKNSKI 
LEICDNVTMYW\ INPTL\ I SGTFAKQLHHNFAF I ILVSELQDFE 
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Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, H=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








EEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELP " 
INDYTENG IE FDPMLDERGY CCI YCRRGNRYCRRVCE PLLGYYP 
YPYCYQGGRVICRVIMPCNWWVA3MLGRV 


6064 


913 


311 


NLPQSLPRPTEHSPPYSLEKMTDLVAWDVALSDGVHKIEFEHG 
TTSGKRWYVDGKEEIRKEWMFKLVGKETFYVGAAKTKATINID 
AISGFAYEYTLE1NGKSLKKYMEDRSKTTNTWVLHMDGENFRIV 
LEKDANDVWCNGKKLETAGEFVDDGTETHFSIGTH\ACYIKAV\ 
SSG\KRKEGIIHTLIVDNREIPEIAS 


soss 


1153 


641 


MSVRVARVAWVRGLGASYRRGASSFPVPPPGAQGVAELLRDATG 
AEEEAPWAATERRMPGQCSVLLFPGQGSQVVGMGRGLLNYPRVR 
ELYAAARRVLGYDLLELSLHGPQETLDRTVHCQPAIFVASLAAV 
EKLHHLQPSVIENCVAAAGFSVGEFAALVFAGAMEFAEG 


6066 


68 


3470 


VKENMPATRKPMRYGHTEGHTEVCFDDSGSFIVTCGSDGDVRIW 
EDLDDDDPKFINVGEKAYSCALKSGKLVTAVSNNTIQVHTFPEG 
VPDGILTRFTTNANHVVFNGDGTKIAAGSSD\FLVKIVDVMDSS 
QQKTFRGHDAPVLSLSFDPKDIFLASASCDGSVRVWQISDQTCA 
ISWPLIjQKCNDVINAKS I CRLAWQP KS 3 KLLA I P VEKS VKL YRR 
ESWSHQFDLSDNFISQTLNIVTWSPCGQYLAAGSIWGLIIVWNV 
ETKDCMERVKHEKGYAICGLAWHPTCGRISYTDAEGNLGLLENV 
CDPSGKTSSSKVSSRVEKDYNDLFDGDDMSNAGDFLNDNAVEIP 
SFSKGIINDDEDDEDLMMASGRPRQRSHILEDDENSVDISMLKT 
GSSLLKEEEEDGQEGSIHNLPLVTSQRPFYDGPMPTPRQKPFQS 
GS TPLHLTHRFM VWNS I G 1 1 RC YNDEQDNAI DVE FHDTS I HHAT 
HLSNTLNYTIADLSHEAILLACESTDELASKLHCLHFSSWDSSK 
EKIIDLPQKEDTKATrT.GQGWRAAATSALLLRLFTIGGVQKEVF 
a F '^"r/AGHGEQLFIVYHRGTGFDGDQCLGVQLLELGKKKK 
QILHGDPLPLTRKSYLAWIGFSAEGTPCYVDSEGIVRMLNRGLG 
-MTWTPICNTREHCKGKSDHYWWGIHENPQQLRCIPCKGSRFPP 
TLPRPAVAILSFKLPYCQIATEKGQMEEQFWRSVIFHNHLDYLA 
KNGYEYEESTKWQATKEQQELLMKMLAIjSCKLEREFRCVELADL 

mtqnavnlaikyasrsrklilaqklselavekaaeltatqveee 
3eeedfrkklnagysntatewsqprfrnqveedaedsgeaddee 
kpeihkpgqnsfskstnssdvsaksgavtfsscgrvnpfkvsas 
skepamsmwsarstnildnmgksekkstalsrttnnekspiikp 
lipkpkpkqasaasyfqkrnsqtnkteevkeenlknvlsetpai 
cppqntenqrpktgfqmwleenrsnilsdnpdfsdeadiikegm 
irfrvlsteerkvwankakgetasegteakkrkrwdesdeten 
qeekakenlnlskkqkpldfstnqklsafafkqe 


60S7 


858 


321 


L?"A 7 QRLGVLLSRGKMAVTGWLESLRTAQKTALLQDGRRKVHYLF 
PDGKEKAEEYDEKTSELLVRKWRVKSALGAMGQWQLEVGDPAPL 
GAGNLGPELIKESNANPIFMRKDTKMSFQWIRNLPYPKDVYSV 
SVDQKERCI IVRTTNKKYYKKFSIPDLDRHQLPLDDALLSFA\T 
PTAP 


6068 


13 


173 0 


GSKMAZLANEEKPAIAPPVFVFQKDKGQKSPAEQKNLSDSGEEP 
RGEAEAPHHGTGHPESAGEHALEPPAPAGASASTPPPPAPEAQL 
PPFPRELAGRSAGGSSPEGGEDSDREDGNYCPPVKRERTSSLTQ 
FPPSQSEERSSGFRLKPPTLIHGQAPSAGLPSQKPKEQQRSVLR 
PAVLQAPQPKALSQTVPSSGTNGVSLPADCTGAVPAASPDTAAW 
P.SPSEAADEVCALEEKEPQKNESSNASEEEACEKKDPATQQAFV 
FGQNLRDRVKLINESVDEADMENAGHPSADTPTATNYFLQYISS 
PLEN5TNSADASSNKFVFGQNMSERVLSPPKLNEVSSDANRENA 
ArtESGSESSSQEATPEKESLAESAAAYTKATARKCLLEKVEVIT 
GEERZSMVLQMQCKLFVFDKTSQSWVERGRGLLRLNDMASTDDG 
TLQSRLSDAGPRGSLR\LILNTKLWAQMQIDKASEK\SIRITAM 
ENEDQGVKVFLISASSKDTGQVYA&LHHRILALRSRVEQEQEAK 
MPAPEPGAAPSNEEDDSDDDDVLAPSGATAAGAGDEGDGQTTGS 
T 


6069 


583' 


27 


PTP.PGQAGSSSAMAAQRLGKRVLSKLQSPSRARGPGGSPGGLQK 
RKARVTVKYDRRELQRRLDVEKWIDGRLEELYRGMEADMPDEIN 
IDELLELESEEERSRKIQGLLKSCGKPVEDFIQELIAKLQGLHR 
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Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutaraine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








Q\PGLRQFSPSP\DGQPSAPFQGPGARTASPLTLLALFPGPPER 
RPALLCVLSCI 


6070 




858 


IRVTVDGEFLHYIFPLQFLDSPEW/R?TETHRGRHF\QVTLTAE 
TDCRYVSWRRKKLYLLFAQHRYISRLFSVLIGSDIADKLYALND 
RVYIGKRYHYDIRLPNFYQMSTPEIRRSPLTQHFQNSRRYW 








HEARTKGNKALARP\VRLFSLVTRLLLAPRRGLTVRSPDEPLPV 
VRIPVALQRQLEQRQSRRRNLPRPVLVRPGPLLVSARRPELNQP 
ARLTLGRWERAPLASQGWKSRRARRDHFSIERAQQEAPAVRKLS 
SKG3FADLGAWKPRVLHALQE\AAPEWQ\PTTVQSSTIPSLLR 
3RHWCAAETGSGKTLSYLLPLLQRLLG\HPSLDSLPIPAPRGL 
VLVPSRELAQQVRAVAQPLGRSLGLLVRDLEGGHGMRRIRLQLS 
RQPSADVLVATPGALWKALKSRLISLEQLSFLVLDEADTLLD3S 
FLELVDYILEKSHIAEGPADLEDPFNPKAQLVLVGATFPEGVGQ 
LLNKVASPDAVTTITSSKLHGIMPHVKQTFLRLKGADKVAELVH 
I LKHRDRAERTGP S GTVLVFCNSSS TVNWLGYI LDDHK I QHLRL 
QGQMPALMRVGIFQSFQKSSRDILLCTDIASRGLDSTGVELVVN 
YDFPPTLQDYIHRAGRVGRVGSEVPGTVISFVTHPWDVSLVQKI 
ELAARRRRSLPGLASSVKEPLPQAT 


6072 




742 


TEMMPTT h h - - f T 
VLFSRRTSKQQVYFFLFNDVLIITKKKSEESYNWDYSLRDQLL 
VESCDNEELNSSPGKNSSTMLYSRQSSASHLFTLTVLSNHANEK 
VEMLLG7iErQ'£^i^\kWITALG/IGCGKPPAD?.TGLTQVEIVRSFT 

akqpdelslqvadvvli\yqrvsegwyeger\lrdgergkfpme 
cakeitcqatidknvermgrlijGLEtnv 


S073 


620 


360 


PCRRGLARPLSRRPG/SILVHCAVGVSRSATLVLAYIjMLYHHLT 
IVEAIKKVXDHRGriPNNGKT.RQT.LALDRRLRQGLEA 


6074 


168 


1110 


JGARCMAT^t, CPDSMPCHNQQVNSASTP3PEQLRPGDLILDHA 
GGNRASRAKVILLTGYAHSSLPAELDSGACGGSSLNSEGNSGSG 
DSSSYDAPAGNSFLEDCELSRQIGAQLKLLPMNDQIRELQTIIR 
DKTASRGDFMFSADRLIRLWEEGLNQLPYKECMVTTPTGYKYE 
GVKFEKGNCGVSIMRSGEAMEQGIiRDCCRSIRIGKILIQSDEET 
QRAKVYYAKFPPDIYRRKVLLMYPILQTG\NTVIEAVKVLIEHG 
VQPSVIILLSLFS7PH3AKSIIQEFPEITILTTEVHPVAPTHFG 
QKYFGTD 


6075 


320 


1091 


P?TCQPQEVEHH\YGYVPILGNKTLPSRCHQCVIVSSSSHLLGT 
KLGPEIERAECTIRMNDAPTTGYSADVGNKTTYRVVAHSSVFRV 
LRRPQ5FWRTPETVFIFWGPPSKMQKPQGSLVRVIQRAGLVFP 
NMEAYAVSPGRMRQFDDLFRGETGKDREKSHSHLSTGWFTMVIA 
VELCDHVHVYGMVPPNYCSQRPRLaRMPYHYYEPKGPDECVTYI 
QH3HSRKGKHHRFITEKRVFSSWA2LYGITFSHPSWT 








HPSPTEAPRVQHLTMDCTWRILFLVAAATGTHAQVQLVQSGAEV 
KKPGASVKVSCKVSGYTLTELSMHWVRQAPGKGLEWMGAFDPED 
GETIYAQKFQGRVTMTEDTSTDTAYMELSSLRSEDTAVYYCATD 
HGDYAFDIWGQ3TMVTVSSAPTKAPDVFPIISGCRHPKDNSPW 
LAC1iITGYHPTSV\TVTWYMGTQSQA\QRTFPEIQRRDSYYMTS 
SQLSTPLQQWRQGEYKCWQHTASKSKKEZFRWPESPKAQASSV 
PTAQPQAEGSLAKATTAPATTRNTGRGGEEKKKEKEKEEQEERE 
TKTPECPSHTQPLGVYLLTPAVQDLWLRDKAT^TCFWGSDLKD 
T 'EVAGKVPTGGVEEGLLERHSNGSQSQHSRLTLPRSLWNA 
GTSVTCTLWHPSLPPQRLMALREPAAQAPVKLSLNLLASSDPPE 
A\ASWLLCEVSGFEPPNILLMWLEDHGEVNTSGFAPARPL?KP\ 
RSTTFWA\WSVLRVPAPPSPQPATYTCWSHEDSRTLI,NASRSL 
EVSYVTDHGPMK 


6077 


3687 




L^PDMNLQPIFWIGLISSVCCVFAQTDENRCLKANAKSCGECIQ 
AGPNCGWCTNS TFLQEGMPTSARCDDLEALKKKGCPPDD I ENPR 
C-SKDIKKNKSTVTNRSKGTAEKLKPEDITQIQPQQLVLRLRSGEP 
QTFTLKFKRAEDYPIDLYYLM\DLSYSMKDDLENVKSLGTDLMN 
EMRRITSDFRIGFGSFVEKTVMPYISTTPAKLRNPCTSEQNCTS 
PFSYKNVLSLTNKGEVFNELVGKQRISGNLDSPEGGFDAIMQVA 
VCGSLIGWRNVTRLLVFSTDAGFHFAGDGKLGGIVLPNDGQCHL j 
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Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








ENNMYTMSHYYDYPSIAHLVQKLSENNIQTIFAVTEEFQPVYKE 
LKNLIPKSAVGTLSANSSNVIQLIIDAYNSLSSEVILENGKLSE 
GVT I S YQS Y\ CKNGVNGTGENGRKCSNIS IGDEVQFEIS ITSNK 
CPKKDSDSFKIRPLGFTEEVEVILQYICECECQSEGIPESPKCH 
EGNGTFECGACRCNEGRVGRHCECSTDEVNSEDIGCFTARKENQ 
FQKSASNHGRVPSAGQCVCRKRDNTNEIYSGKFCECDNFNCDRS 
NGL I CGGNGV CKCRVCE CNPNYTGSACDCSLDTSTCEASNGQ I C 
NGRGICECGVCKCTDPKFQGQTCEMCQTCLGVCAEHKECVQCRA 
FNKGEKKDTCTQECSYFNITKVESRDKLPQPVQPDPVSHCKEKD 
VDDCWFYFTYSVNGNNEVMVHWENPECPTGPDIIPIVAGWAG 
IVLIGLALLLIWKLLMIIHDRREFAKFEKEKMNAKWDTGEMPIY 
KSAVTTWNPKYEGK 


6078 


1425 


180 


ETEDVMELLEEDLTCPICCSLFDDPRVLPCSHNFCKKCLEGILE 
GSVRNSLWRPVPFKCPTCRKKTFSYWELIPLQ'/NYSLKGIVEKY 
NKIKISPICMPVCKGH\LGQPLNIF\CL\TDMQLDL/CGIC\ATR 
GEHTKHVFCSIEDAYAQERDAFESLFQSFE7WRRGDALSRLDTL 
ETSKRKSLQLLTKDSDKVKEFFEKLQHTLDQKKNEILSDFETMK 
LAVMQAYDPEINKLNTILQEQRMAFNIAEAFKDVSEPIVFLQQM 
QE FREKIKV I KETPLP PSN LPAS PLMKNFDTSQWED I KLVDVDK 
LSLPQDTGTFISKIPWSFYKLFLLILLLGLVIVFGPTMFLEWSL 
FDDLATWKGCLSNFSSYLTKTADFIEQSVFYWEQVTDGFFIFNE 
RFKNFTLWLNNVAEFVCKYKLL 


6079 


158S 


141 


ATARDL3CARRIDRWMESTPSRGLNRVHLQCRNLQEFLGGL5P 
GVLDRLYGHPATCLAVFRELPSLAKNWVKRMLFLEQPLPQAAVA 
EFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPTFRQ 
NLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEWL 
HFMVGSPSAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFL 
LLDTPAQLWYFMLQYLQTAQSRGMDLVE ILS FLFQLS FSTLGKD 
YSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPT/RALAINL 
SSGVSGAGGTVHQPGFIV\VETNYRLYAYTESELQIALIALFSE 
MLYPFP\NMW\ARVTR\ESVQQAIASGITAQQIIHFLRTRAHP 
VMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDF 
ELL\LAHAPKLGVLVFE/NTPAKRLMWTPAGHSDVKRFWKRQK 
HSS 


6080 




1199 


IETIDHVGEFAMAAQAAGVSRQRAATQGLGENQNALKYLGQDFK 
TLRQQCLDSGVLFKDPEFPACPSALGYXDLGPGSPQTQGIIWKR 
PTELCPSPQFIVGGATRTDICQGGLGDCWLLAAIASLTLNEELL 
YRWPRDQDFQENYAGIFHFQPLCPPS?\FWQYGEWVEWIDDR 
LPTKNGQLLFLHSEQGNEFWSALLEKAYAKLNGCYEALAGGSTV 
EGFEDFTGGISEFYDLKKPPANLYQIIRKALCAGSLLGCSIDVY 
3AA2AEAITSQKLVKSHAYSVTGVEEVNFQGHPEKLIRLRNPWG 
3VEKSGAWSDDAPEWNHIDPRRKEELDKKVEDGEFWMSLSDFVR 
QFSRLEICMLSPDSLSSEEVHKWNLVLFNGHWTRGSTAGGCQNY 
PGSS 


S081 


3 


865 


3MLPLLLPLPLLWA/GALAQDARFRLEMPESVTVQEGLCIFVHC 
SVFYLEYGWKDSTPAYGHWFREGVSVDQETPVATIffirSTQKVQKE 
TQ3RFHLLGDPSRKNCSLSIRDARRRDNGSYFFWVARGRTKFSY 
KYSPLSVYVTALTHRPDILIPEFLKSGHPSNLTCSVPWVCEQGT 
PPIFSWMSAAPTSLGPRTLHSSVLTIIPRPQDHGTNLICQVTFP 
GAGVTTERTIQLSVSWKSGTVEEVWLAVGWAVKILLLCLCLI 
ILSFHKKKAVRAVEVEENVYAVMG 


6082 


283 


1288 


1"-- ~r 1PTQTRTAPG1AAPGLAQPAALRLLLSRPPSAAMDGDGD 
PE3VGQPEEASPEEQPEEASAEEERPEDQQEEEAAAAA\Y\LDE 
LPEPLLA/LRVLAALPRHE\LVQACR\LVCLRWKELVDGAPLWL 
LKCQQEGLVPEGGVEEERDHWQQFYFLSKRRRNLLRNPCGEEDL 
EGWCDVEHGGDGWRVEELPGDSGVEFTHDESVKKYFASSFEWCR 
KAQVIDLQAEGYWEELLDTTQPA1WKDWYSGRSDAGCLYELTV 
KLISEHENVLAEFSSGQVAVPQDSDGGGWMEISHTFTDYGPGVR 
FVRFEHGGQDSVYWKGWFGARVTNSSVWVEP 


6083 


1865 


309 


KQWCAERRGLGMSLADELLADLEEAAEEEEGGSYGEEEEEPAIE 
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DVQEETQLDLSGDSVKTIAKLWDSKMFAEIMMKIEEYISKQAKA 
S E VMG P VEAAPE YRVI VDANNLTVE IENELNI IHKFIRDKYS KR 
FP3LESLVPNALDYIRTVKELGNSLDKCKNNENLQQILTNATIM 
WSVTASTTQGQQLSEEELEP.LEEACDMALELNASKHRIYEYVE 
SRMSFIAPNL3IIIGASTAAKIMGVAGGLTNLSKMPACNIMLLG 
AQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPIPPPFSVAP\DL 
RRKAARLVAAKCTLAARVDSFHESTEGKVGYELKDEIERKFDKW 
QEPPPVKQVKPLPAPLDGQRKKRGGRRYP.KMKERLGLTEIR\KQ 
ANRMSFGEIE3DAYQEDLGFSLGHLGKSGSGRVRQTQVNEATKA 
RISKTLQRTLQKQSWYGGRSTIRDRSSGTASSVAFTPLQGLEI 


6084 


1865 




KQWCAERRGLGMSLADELLADLEEAAEEEEGGSYGEEEEEPAIE 
DVQEETQLDLSGDSVKTIAKLWDSKMFAEIMMKIEEYISKQAKA 
SEVMGPVEAAPEYRVIVDANNLTVEIENELNIIHKFIRDKYSKR 
FPELESLVPNALDY1RTVKELGNSLDKCKNWENLQQILTNATIM 
WSVTASTTQGQQLSEEELERLEEACDMALELNASKHRIYEYVE 
S RMS FIAPNLSII IGAS TAAK I MGVAGGLTNLS KMPACN I MLLG 
AQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPIPPPFSVAP\DL 
RRKA^RLVAAKCTLAARVDSFHESTEGKVGYELKDEIERKFDKW 
QEPPPVKQVKPLPAPLDGQRKKRGGRRYRKMKERLGLTEIR\KQ 
ANRMSFGEIEEDAYQEDLGFSLGHLGKSGSGRVRQTCVNEATKA 
RISKTjQRTLQKQSVVYGGKSTIRDRSSGTASSVAFTPLQGLEI 


6085 




1456 


SGPRSFQGKRAVGRISLGGKRNPEVTLLPGVSSEHVRRWRRARV 
GVARVKPGNPWKPSPATQVPR/VPAQVYLPGRGPPLREGEELVM 
DEEAYVLYHRAQTGAPCLSFDIVRDHLGDNRTELPLTLYLCAGT 
2AESAQSNRLt4MLRMHNLHGTKPPPSEGSDEEEEEEDEEDEEER 
KPQLELAMVPHYGGIWRVRVSWLGEEPVAGWSEKGQVEVFALR 
RLLQVVEEPQALAAFLRDEQAQMKPIFSFAGHMGEGFALDWSPR 
VTC-RLLTGDCQKNIHLWTPTDGGEWHVDQRPFVGHTRSVEDLQW 
S P TENTVFAS CSADAS I R I WD I RAAPSKACML TTATAHDGDVNV 
ISWSRREPFLLSGGDDGALKIWDLRQFKS3SPVATFKQHVAPVT 
SVEWHPQDSGVFAASGADHQITQWDLG/IVERDPEAGDVEADPG 
LAELPQQLLFraQGETELKELHWHPQCPGLLVSTALSGFTIFRT 
ISV 


6086 


2419 


1357 


GAATQHGGAMNLLPCNPHGNGLLYAGFNQDHGCFACG.MEKGFRV 
YNTDPLKEKEKQEFLEGGVGHVEMLFRCNYLALVGGGKKPKYPP 
NKVMIWDDLKKKTVIEIEFSTEVKAVKLRR\DKIWVLDSMIKV 
?TFTHNP\HQLHVFE\TCYNPKGLCVLCPNSNNSLLAFPGTHTG 
HVQLVDLASTEKPPVDIPAHEGVLSCIALNLQGTRIATASEKGT 
LIRIFDTSSGHLIQELRRGSQAANIYCINFNQDASLICVSSDHG 
TVHI FAAEDPKRNKQS SLASAS FLPKYFS SKWSFSKFQVPSGS P 

DDKL 


6087 


476 


1877 


QNSQRTGLPITIFSRSFPLLTGSDLCBNMPCTCTWRNWRQWIRP 
LVAVIYLVSIWAVPLCVWELQKLEVGIHTKAWFIAGIFLLLTI 
PISLWVILQHLVHYTQPELQKPIIRILWMVPIYSLDSWIALKYP 
GIAIYVDTCRECYEAYVIYNFMGFLTNYLTNRYPKLVLILEAKD 
QQKHFPPLCCCPPWAMGEVLLFRCKLGVLQYTWRPFTTIVALI 
CELLGIYDEGNFSFSNAWTYLVIINNMSQLFAMYCLLLFYiCVLK 
EEL3PIQPVGKFLCVKLWFVSFWQAWIALLVKVGVISEKHTW 
EI','2TVEAVATGLQDFIICIEMFLAAIA\HHYTFSYKPYVQEAEE 
GSCFDSFLAMWDVSDIRDDISEQVRHVGRWRGHPRKKLFPEDQ 
DQNEHTSLLSSBSQDAISIASSMPPSPMGHYQGFGHTVTPQTTP 
TTAKISDEILSDTIGEKKEPSDKSVDS 


S088 


1684 


689 


- LVR1 )QG RCLLAPVAPKLVPPVRGVKKGFRAAFRFQKE 
LJRiJxLLRCPPPPVRRSEKPNWDYHAEIQAFGHRLQENFSLDLL 
KTAFVMSCYIKSEEAKRQQLGIEKEAVLLNLKSWQELSEQGTSF 
SQTCLTQFLEDEYPDMPTEGIKNLVDFLTGEEWCHVARNLAVE 
QLTLSEEFPVPPAVLQQTFFAVIGALLQSSGPERTALFIRDFLI 
TQMTGKELFEMWKIINPMGLLVEELKKRNVSAPESRLTRQSG\A | 
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Codon, /=possible nucleotide deletion, 
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PTALPLYFVGLYCDKKLIAEGPGETVLVAEEEAARVALRKLYGF 
TENRRPWNYS KPKETLRAEKS I TAS 


6089 


3 


3054 


TR_,GIPGSTISSRPRLCALAAEGHFLGHSWTGSRAGAHTGAPAW 
PSRRLRDLPAGGMWRLRRAAVACEVCQSLVKHSSGIKGSLPLQK 
LHLVSRSIYHSHHPTLKLQRPQLRTSFQQFSSLTNLPLRKLKFS 
P I KYGYQPRRNFKPARLATRLLKLRYLI LGSAVGGGYTAKKTFD 
QWKDMIPDLSEYKWIVPDIVWEIDEYIDFEKIRKALPSSEDLVK 
LAPDFDKIVESLSLLKDFFTSGSPEETAFRATDRGSESDKHFRK 
VSDKEKIDQLQEELLHTQLKYQRILERLEKENKELRKLV1QKDD 
KGIPFIESLRKSLIDMYSEVLDVLSDYDASYHTQDHLPRWVVG 
DQSAGKTSVLEMIAQARIFPRGSGEMMTRSPVKVTLSEGPHHVA 
1FKDSSREFDLTKEEDLAALRHEIELRMRKWKEGCTVSPETIS 
LNVKGPGLQRMVLVDLPGVINTVTSGMAPDTKETIFSISKAYMQ 
DPHAIILCIQDGSVDAERSIVTDLVSQMDPHGRRTIFVLTKVDL 
AEKNVASPSRIQQIIEGKLFPMKALGYFAWTGKGNSSESIEAI 
REYEEEFFQNSKLLKTSMLICAHQVTTRNLSLAVSDCFWKMVRES 
VEQQADSFKATRFNLETEWKMNYPRLRELDRIIELFEKAK1IEILD 
EVISLSQVTPIOJWEE I LQQSLWERVSTHVIEMI YLPAAQTMNSG 
TFNTTVDIKLKQWTDKQLPNKAVEVAWETLQEEF3RFMTEPKGK 
EHDDIFDKLKEAVKEESIKRHKWNDFAEDSLRVIQHNALEDRSI 
SDKQUWDAA1YFMEEALQARLKDTENAIENMVGPD\WKKRWLYW 
KNRTQEQCVHMETKNELEKMLKCNEEHPAYLASDEITTVRKNLE 
SRGVEVDPSLIKDTWHQVYRRHFT.KTALNHCNLCRRGFYYYQRH 
FVDSELECNDWLFWRIQRMLAITANTLRQQLTNTEVRRLEKNV 
KEVLEDFAEDGEKKI KLLTGKRVQLAEDLKKVRE I QEKLDAF I E 
ALHQEK 


6090 




1560 


PVFVPAPGAVLEQAS/ASPPLATQTVVPLQHCKIPELPVQASIL 
FELQLFFCQLIALFVHYINIYKTWWYPPSHPPSHTSLNFHLID 
FNLLMVTTIVLGRRFIGSIVKEASQRGKVSLFRSILLFLTRFTV 
LTATGWSLCRSLIHLFRTYSFLNLL/FPLLSVWDVHSVPAAELR 
P\RKTSLFNHMASMGPREAVSGLAKSRDYLLTLR\RR3SSTQDS 
CMARTPCP/PHACCLSPSLIRSEVEFLKMDFNWRMKEVLVSSML 
3AYYVAFVPWFVKNTHYYDKRWSCELFLLVSISTSVILMQHLL 
PASYCDLLHKAAAHLGCWQKVDPALCSNVLQHPWTEECMWPQGV 
LVKHSKNVYKAVGHYNVAIPSDVSHFRFHFFFSKPLRILNILLL 
LEGAVIVYQLYSLMSSEKWHQTISLALILFSNYYAFFKLLRDRL 
VLGKAYSYSASPQRDLDHRFS 


6091 


3279 




SSRTREMEEKEILRRQIRLLQGLIDDYKTLHGNAPAPGTPAASG 
WQPPTYHSGRAFSARYPRPSRRGYSSHHGPSWRKKYSLVNRPPG 
P3DPPADHAVRPLHGARGGQPPVPQQHVLERQVQLSQGQNWIK 
VKPPSKSG5ASASGAQRGSLEEFEDTPWSDQRPREGEGEPPRGQ 
LQPSRPTRARGTCSVEDPLLVCQKEPGKPRMVKSVGSVGDSPRE 
PRRTVSESVIAVKASFPSSALPPRTGVALGRKLGSHSVASCAPQ 
LLGDRRVDAGHTDQPVPSGSVGGPARPASGPRQAREASLWTCR 
... ■ --WYKWVAASSKSPRVARRALSPRVAAENVCKASAGMAW 
KVEKPQLIADPEPKPRKPATSSKPGSAPSKYKWKASSPSASSSS 
SFR[ .-I]AG5KDHASQLSPVLSRSPSGD\RPAVGHSGLKPLSGE 
TPLSAYKVKSRTKIIRRRGSTSLPGDKKSGTSPAATAKSHLSLR 
RRQALRGKSSPVLKKT D NKGLVQVTTHRLCRLPPSRAHLPTKEA 
S S LHAVRTAPTSKVI KTRYRI VKKTPASPLSAPPFPLSLPSWRA 
RRL3LSRSLVLNRLRPVASGGGKAQPGSPWWRSKGYRCIGGVLY 
KVSANKLSKTSGQPSDAGSRPLLRTGRLDPAGSCSRSIASRAVQ 
RSLAI IRQARQRREKRKEYCMYYNRFGRCMRGERCPYIHDPEKV 
AVCTRFVRGTCKKTDGTCPFSHHVSKEKMPVCSYFLKGICSNSN 
CPYSHVYVSRKAEVCSDFIjKGYCPLGAKCKKKHTLLCPDFARRG 
ACPRGAQCQLIjHRTQKRHSRRAATSPAPGPSDATARSRVSASHG 
PRK? SASQRPTRQTPS S AALTAAAVAAP PHCPGGSASPSSS KAS 
SSSSSSSSPPASLDHEAPSLQEAALAAACSNRLCKLPSFISLQS 
£ PS PGAQPRVRAPRAPLTKDSGKPLHIKPRL 


5092 


143 


3190 


AKAP PTGES S E ? EAKVLHTKRLYRAWEAVHRLDLILCNKTAYQ 
2VFKPENISLRNKLRELCVKLMFLHPVDYGRKAEELLWRKVYYE 
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(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
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VIQLIKTNKKHIHSRSTLECAYRTHLVAGIGFYQHLLLY1QSHY 
QLELQCCIDWTHVTDPLIGCKKPVSASGKEMDWAQMACHRCLVY 
LGDLSRYQNEIAGVDTELLAERFYYQALSVAPQIGMPFNQLGTL 
AG S KY YNVE AMY CYLRCIQSEVS FEGAYGNLKRLYDKAAKMY HQ 
LKKCETRKLSPGKKRCKDIKRLLVNFMYLQSLLQPKSSSVDSEL 
TSLCQSVLEDFNLCLFYLPSSPNLSIASEDEEEYESGYAFLPDL 
LIFQMVIICLMCVHSLERAGSKQYSAAIAFTLALFSHLVNHVNI 
RLQAELEEGENPVPAFQSDGTDEPESKEPVEKEEEPDPEPPPVT 
PQVGEGRKSRKFSRLSCLRRRRHPPKVGDDSDLSEGFESDSSHD 
SARASEGSDSGSDKSLEGGGTAFDAETDSEMNSQESRSDLEDME 
EEEGTRSPTLEPPRGRSEAPDSLNGPLGPSEASIASNLQAMSTQ 
MFQTKRCFRLAPTFSNLLLQPTTNPHTSASHRPCVNGDVDKPSE 
PASEEGSESEGSESSGRSCRNERSIQEKLQVLMAEGLLPAVKVF 
LDWLRTNPDLIIVCAQSSQSLWNRLSVLLNLLPAAGELQESGLA 
LCPEVQDLLEGCELPDLPSSLLLPEDMALRNLPPLRAAHRRFNF 
DTDRPLLSTLEE S WR1 CCIRSFGHF I ARLQGS ILQFNPEVGI F 
VS I ACS EQESLLQQAQAQFRMAQEEARRNRLMRDMAQLRLQLEV 
SQLEGSLQQPKAQSAMSPYLVPDTQALCHHLPVIRQLATSGRFI 
VIIPRTVIDGLDLLKKEHPGARDGIRYLEAEFKKGNRYIRCOKE 
VGKSFERHKLKRQDADAWTL.YKILDSCKQLT\LAQGAGEEDPSG 
M VT 1 1 TGLP LDWP S LLS G PMQAALQAAAHAS VD I KNVLDF YKQW 
KEIG 


6093 


76 


1002 


ACGRRAMLALRVART/SRWGAL\RGAVWAPGTRPSKRRACWALIi 
PPVPCCLGCLAERWRLRPAALGLRLPGIGQRNHCSGAGKAAPR\ 
PAAGAGAA^EAPGGQWGPASTPSLYENPWTIPNMLSMTRIGLAP 
VLGYLIIEEDFKIALGVFALAGLTDLLDGFIARNWANQRSALGS 
ALDPLADKILISILYVSLTYADLIPVPLTYMIISRDVMLIAAVF 
YVRYRTLPTPRTLAKYFNPCYATARLKPTFISKVNTAVQLILVA 
ASLAAPVFNYADSIYLQILWCFTAFTTAASAYSYYHYGRKTVQV 
IKD 


6094 


23 


1010 


PFLRCLRGDQKAKMSERKVLWKYYPPDFDPSKIPKLKLPKDRQY 
WRLiyiAPFNMRCKTCGEYIYKGKKFNARKETVQNEVYLGLPIFR 
FYIKCTRCLAEITFKTDPENTDYTMEHGATRNFQAEKLLEEEEK 
RVQKEREDEELNNPMKVLENRTKDSKLEMEVLENLQELKDLNQR 
QAHVDFEAMLRQHRLSEEERRRQQQEEDEQETAALLEEARKRRL 
LEDSDSEDEAAPSPLQPALRPNPTAILDEAPKPKRKVEVWEQSV 
GSLGSRPPLSRLVWKKAKADPDCSNGQPQA/APHPRSPAEQEG 
GQPYTPDAWRVLPEPTGCIPGQ 


6095 


1 


1599 


TRGRAAERSRGRGHGFLGGGFA\SWDYFPSEDFYRCGYCKNES 
GSRSNGMWAHSMTVQDYQDLIDRGWRRSGKYVYKPVMNQTCCPQ 
YTIRCRPLQFQPS KSHKKVliKKMLKFLiAKGE VP KGS CE \ DE PMD 
STMDDAVAGDFALINKLDIQCDLKTLSDDIKESLESEGKNSKKE 
EPQELLQSQDFVGEKLGSGEPSHS 



TRADOCS:1416257.1(%CSH01 !.DOC) 
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SEQ 
ID 
NO: 


Predicted 

nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 


Fredicted end 

nucleotide 

location 

amino acid 
residue of 

sequence 


- i acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 

L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine , R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








VKVHTVPKPGKGADLSKPPCEKAKEIRKERKRLKLMQQNPAGEL 
EGFQAQGHPPSLFPPKAKSNQPKSLEDLIFESLPENASHKLEVR 
WRSSPPSSQFKATLLESYQVYKRYQMVIHKNPPDTPTESQFTR 
FLCSSPLEAETPPNGPDCGYGSFHQQYWLDGKIIAVGVIDILPN 
CVSSVYLYYDPDYSFLSLGVYSALREIAFTRQLHEKTSQLSYYY 
MGFYIHSCPKMKYKGQYRPSDLLCPETYVWVPIEQCLPSLENSK 
YCRFNQDPEAVDEDRSTEPDRLQVFHKRAIMPYGVYKKQQKDPS 
EEAAVLQYASLVGQKCSERMLLFRN 


6096 


2277 


575 


QRVRAALLSSAMEDSEALGFEHMGLDPRLLQAVTDLGWSRPTLI 
QEKAIPLALEGKDLLARARTGEGKTAAYAIPMLQLLLHRKATGP 
WEQAVRGLVLVPTKELARQAQSMIQQLATYCARDVRVANVSAA 
EDSVSQRAVLMEKPDVWGTPSRILSHLQQDSLKLRDSLELLW 
DEADLLFSFGFEEELKSLLCHLPRIYQAFLMSATFNEDVQALKE 
LILHNPVTLKLQESQLPGPDQLQQFQWCETEEDKFLLLYALLK 
LSLIRGKSLLFWTLERSYRLRLFLEQFSIPTCVLNGELPLRSR 
CHTTSQFNQGFYDCVIATDAEVLGAPVKGKRRGRGPKCDKASDP 
EAGVARGIDFHHVSAVLNFDLPPTPEAYIHRAGRTARANNPGIV 
LTFVLPTEQFHLGKIEELLSGENRGPILLPYQFRMEEIEGFRYR 
CRD/iMRSVTKOAIREARLKElKEELijll-T'"! " r EDNPR\DLQ 
LLRIIDLPLHPAWKPHLGHVPDYLVPPALRGLVRPHKK\GRSCL 
PLVGRPREQSPRTHCAASSTKERKSDPQPSPPEWGPLWS 


6097 


1673 




APGTMSGGKKKSSFQiTSV L'TIJY-GI V r'JASDPPTPOPPTGPP 
PRLPNGEPSPDPGGKGTPRNGSPPPGAPSSRFRWKLPHGLGEP 
YRRGRWTCVDVYERDLEPHSFGGLLEGIRGASGGAGGRSLDSRL 
ELASLGLGAPTPPSGLSQGPTSWLRPPPTSPGPQARSFTGGLGQ 
LWPSKAKAEKPPLSASSPQQRPPEPETGESAGTSRAATPLPSL 
RVEAEAGGSGARTPPLSRRKAVDMRLRMELGAPEEMGQVPPLDS 
RPSSPALYFTHDASLVHKSPDPFGAVAAQKFSLAHSMLAISGHL 
DSDDDSGSGSLVGIDNKIEQAMDLVKSHLMFAVREEVEVLKEQI 
RELAERNAALEQENGLLRALA\SPEQLGSAGPPRGVPR\LGPPA 
PNGPFVLSLPSLTIVPLGLPGLASAAWPPLPMPALIVPVFPGVG 
VQALSNGPWSPGPLPHLLIIPSLDGGGEGFRTGRQQGAPFGEET 


G0S8 




1074 


NYCLRHRSPLEKDSSPGSSSTSLLIKKQRETSDTPIMRALKELD 
EGKIFKNWGTQTEKEDTSNINPRQTETSVNASRSPEKCAQQRQK 
RLNSASQRS SSLP PSNRKS STPTKRE IMLTPVTVAYS PKRS PKE 
KLSPGFSHLLSKNESSPIRFDILLDDLDTVPVSTLQRTNPRKQL 
\QFLPLDDSEEK\TYSEKAT\DNIVNHSSCPEPVPNGVKKVSVR 

E L T KE XDQ I EAALS RM PS PGGR I TLQTRLITQE AFGRS FGKD 


6099 


168 


1074 


IIYCLRHRSPLEKDSSPGSSSTSLLIKKQRETSDTPIMRALKELD 
EGKIFKNWGTQTEKEDTSNINPRQTETSV1IASRSEEKCAQQRQK 
RLNSASQRSSSLPPSNRKSSTPTKREIMLTPVTVAYSPKRSPKE 
NLSPGFSHLLSKNESSPIRFDILLDDLDTVPVSTLQRTNPRKQL 
\QFLPLDDSEEK\TYSEKAT\DWIVNHSSCPEPVPNGVKKVSVR 
rA'..'JI-nCKSVSYEQCKPVSVTPQGNDFEYTAKIRTLAETERFF\D 
ELTKEKDQIEAALSRMPSPGGRITLQTRLNQEAFGRSFGKD 


6100 


2 


713 


i E RSRADPEPRGRDTMTYAYLFKYI I IGDTGVGKSCLLL 
QF7DKRFQPVHDLTIGVEFGARMVNIDC-KQIKLQIWDTAGQESF 
RSI TRSYYRGAAGALLVYDITRRETFNHLTSWLEDARQHS SSNM 
VIML IGNKSDLESRRDVKREEGEAFARE \HGLI FMETSAKTACN 
VEEAFINTAKEIYRKIQQGLFDVHNEANGIKIGPQQS ISTSVGP 
SASQRNSRDIGSNSGCC 


5101 


1 


1399 


FRGRAWPLREVSHWLGCRRVCSWSASWGRJiPALSARLSPLLAFR 
GKMVFPLSCAVQQYAWGKMGSNSEVARLLASSDPLAQIAEDKPY 
AEL'AMGTKPRGDAKILDWRISQKTLSQWIAENQDSLGSKVKDTF 
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Predicted end 



corresponding 



Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine , V^Valine, 
W=Tryptophan, Y= Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 



NGNLPFLFKVLSVETPLSIQAHPNKELAEKLHLQAPQHYPDANH 
KPEMAIALTPFQGLCGFRPVEEIVTFLKKVPEFQFLIGDEAATH 
LKQTMSHDSQAVASSLQSCFSHLMKSEKKVWEQLNLLVKRISQ 
QAAAGNNMEDI FGELLLQLHQQ YPGD IGCFAI YFLNLLTLKPGE 
AMFLEANVPHAYLKGDCVECMACSDNTVEAGLTPKFIDVPTLCE 
MLSYTPSSSKDRLFLPTRSQEDPYLSIYDPPVPDFTIMKA\EVP 
G\SVTEYKDLALDSASILLMVQGTVIASTPTTQTPIPLQHGGVL 
FIGANESVSLKLTEPKDLLIFRACCLL 



QTPQATLAANGAEDSRGGEMLPAGEIGASPAAPCCSESGDERKN 
LEEKSDINVTVLIGSKQVSEGTDNGDLPSYVSAFIEKEVGNDLK 
SLKKLDKLIEQRTVSKMQLEEQVLTISSEIPKRIRSALKNAEES 
KQFLNQFLEQETHLFSAINSHLLTAQPWMDDLGTMISQIEEIER 
HLAYLKWISQIEELSDNIQQYLMTNMVPEAASTLVSMAELDIKL 
QESSCTHLLGFMRATVKFWHKILKDKLTSDFEEILAQLHWPFIA 
PPQSQTVGLSRPASAPF.IYSYLETLFCQLLKLQTSHELLTEEK\ 
HSQKNTLFLPPLLSS/WPIQVMLTPLQKRFRYHFRGMRQTKTVLS 
KPEWYLAQVLMWIGNHTEFLDEKIQPILDKVGSLVNARLEFSRG 
LMMLVLEKLATDIPCLLYDDNLFCHLVDEVLLFERELII3VIIGYP 
GTFASCMHILSEETCFQRWLTVERKFALQKMDSMLSSEAAWVSQ 
YKDITDVDEMKVPDCAETFMTLLLVITDRYKNLPTASRKLQFLE 
LQKDLVDDFRIRLTQVMKEETRASLGFRYCAILNAVNYISTVLA 
DWADNVFFLQLQQAALEVFAENNTLSKLQLGQLASMESSVFDDN 
INLLERLKHDMIjTRQVDHVFREVKDAAKLYKKERWLSLPSQSEQ 
AVMSLSSSACPLLLTLRDHLLQLEQQLCFSLFKIFWQMLVEKLD 
VYIYQEIILANHFNEGGAAQLQFDMTRNLFPLFSHYCKRPENYF 
KHIKEACIVLNLNVGSALTAGKDVLPVQLQGSFPAT 



ESNSTMTTYLEFIQQNEERDGVRFSWNVWPSSRLEATRMWPVA 
ALFTPLKERPDLPPIQYEPVLCSRTTCRAVLNPLCQVDYRAKLW 
ACNFCYQRNQFPPSYAGISELNQPAELLPQFSSIEYWLRGPQM 
PLIFLYWDTCMEDEDLQALKESNQMSLSLLPPTALVGLITFGR 
MVQVHELGCEGISKSYVFRGTKDLSAKQLQEMLGLSKVPVTQAT 
RGPQVQQPPPSNRFLQPVQKIDMNLTDLLGELQRDPWPVPQGKR 
PLRSSGVALSIAVGLLECTFPNTGARIMMFIGGPATQGPGMWG 
DELKTPIRSWHDIDKDNAKYVKKGTKHFEALANRAATTGHVIDI 
YACALDQTGLLEMKCCPNLTGGYMVMGDSFNTSLFKQTFQRVFT 
KDMHGQFKMGFGGTLEIKTPR\EIKISGAIGPCVSLNSKGPCVS 
ENEIGTGGTCQWKICGLSPTTTLAIYFEWNQHWAPIPQGG\RG 
A\IQFVTQY\QHSSGQRRIRVTTIARN\WADAQTQIQNIAASFD 
QEAAAILMARLAIYRAETEEGPDVLRWLDRQLIRLCQKFGEYHK 
DDPSSFRFSETFSLYPQFMFHLRRSSFLQVFNNSPDESSYYRHH 
FMRQDLTQSL IMI QPILYAYS FSGPPE PVLLDS SSI LADRILLM 
DTFFQILIYHGETIAQWRKSGYQDMPEYENFRHLLQAPVDDAQE 
ILHSRFPMPRYIDTEHGGSQARFLLSKT/NPSQTHNNMYAWGQES 
GAP I LTDDVSLQVFMDHLKKLAVSSAA 



KVSEYI ILSKDKILFHALAMLVLWSPWSAARGVLRNYWERLLR 
KLPQSRPGFPSPPWGPALAVQ\AQPCLQSQQMIPVEVKRI/RSL 
LDSIFWMAAPKNRRT1EVNRCRRRNPQKLIKVKMNIDVCPECGH 
1KQKHVLCAYCYEKVCKETAEIRRQIGKQEGGPFKAPTIETWL 
YTGETPSEQDQGKRIIERDRKRPSWFTQN 



'LHGACTSLVLQRFCHRRPRPCAPARPEDMRRPAAVPLLLLLCF 
GSQRAKAATACGRPRMLNRMVGGQDTQEG3WPWQVSIQRNGSHF 
CGGSLIAEQWVLTAAHCFRNTSETSLYQVLLGARQLVQPGPHAM 
YARVRQVESNPLYQGTASSADVALVELEAPVPFTNYILPVCLPD 
PSVIFETGMNCWVTGWGSPSEEDLLPEPRILQKLAVPI IDT\ PR 
CNLLYSKDTEFGYQPKTIKNDMLCAC-FEEGKKDACKGDSAGPLV 
CLVGQSWLQAGVISWGEGCARQNRPGVYIRVTAHHNWIHRI I PK 
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SEQ 
NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first 

residue of 

sequence 


Predicted end 
nucleotide 

corresponding 

residue of 
sequence 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=phenylalanine , G=Glycine, 
II=IIistidine, I = Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=poesible nucleotide insertion) 








LQVQPSEVGRPEVTPPGPGAP 




3 


1302 


GRPPTAPHTGRPPTANRGDPRLDLKRGCARLLTSIESRGRPAAS 
AGLRRDRCALRRWPLRRAPljARATRRRAGSPRRCAPRPRACPQG 
WSRARHQPGGLCLLLLLLCQFMEDRSACAGNCWLRQAIQNIGRCQV 
LYKTELSKEECCSTGRLSTSWTEEDVNENTLFKWMIFNGGAPNC 
IPCKETCE^DCGPGKKCRMNKKNKPRCVCAPDCSNITWKGPVC 
GLDGKTYRNECALLKARCKEQPELEVQYQGRCKXTCRDVFCPGS 
STCV \ VDQTNNAYCVTCNRI CPE PAS SEQYLCGNDGVTYS \ SAC 
HLRKATCLLGRS I GLAYEGKC I KAKS CEDIQCTGGKKCLWDFKV 
GRGRCSLCDELCPDSKSDEPVCASDNATYASECAMKEAACSSGV 
LLEVKHSGSCNSISEDTEEEEEDEDQDYSFPISSILEW 


6107 


S23 


1GB 


SRCSSPRPEPGRGRGK/LSPSEHRKWVEVFKACDEDHKGYLSRE 
DFKTAWMLFGYKPSKIBVDSVMSSINPNTSGILLEGFLNIVRK 
KKEAQRYRNEVRHIFTAFDTYYRGFLTLEDFKKAFRQVAPKLPE 
RTVLEVFREV\DRDS\DGHVSF 


6108 


3 


1348 


GC-SLRFSPERVFS.CSRVFL J VPiLL 7 ~l ', FQSPTTPW 
CLPRRYMKHKRDDGPEKQEDEAVDVTPVMTCVFWMCCSMLVLL 
YYFYDLLVYVVIGIFCLASATGLYSCLAPCVRRLP\SASAGESA 
LLAPTIPNNSLPYFHKRPQARMLL1ALFCVAVSWWGVFRNEDQ 
WAWVLQDALG IAFCLYMLKT I RLPTFKACTLLLLVLFLYD I FFV 
FITPFLTKSGSSIMVEVATGPSDSATREKLPMVLKVPRLNSSPL 
ALCDRPFSLLGFGDILVPGLLVAYCHRFDIQVQSSRVYFVACTT 
AYGVGLLVTFVALALMQRGQPALLYLVPCTLVTSCAVALWRREIj 
GVFWTGSGFAKVLPPSPWAPAPADGPQPPKDSATPLSPQPPSEE 
PATSPWPAEQSPKSKXSKEMGAGAPMREPGSPAESEGRDQAQPS 
PVTQPGASA 


6109 




1381 


CRSRAGAASGGAILEGTKLRRQRVDTNKPLDPLVPSALRAAMLY 
LEDYLEMIEQLPMDLRDRFTEMREMDLQVQNAMDQLEQRVSEFF 
MNAKIvNKPEWREEQMASIKKDYYKALEDADEKVQLANQIYDLVD 
RHLRKLDQELAKFKMELBADNAGITEILERRSLELDTPSQPVNN 
HHAHSHTPVEKRKYNPTSHHTTTDHIPEKKFKSEALLSTLTSDA 
SKENTLGCRNKMSTASSRNAYNVNSSQPLGSYNIGSLSSGTGAG 
GI \TMAAAQAVQATAQMKEGRRTS SLKAS YEAFKNND FQLGKEF 
SMARETVGYSSSSALMTTLTQNASSSAADSRSGRKSKNNNKSSS 
QQSSSSSSSSSLSSGSSSSTWQEISQQTTWPESDSNSQVDWT 
YDPNEPRYCICNQVSYGEMVGCDTQDCPIEWFHYGCVGLTEAPK 
GKWYCFQCT\AAMKRRGSRHK 


6110 


77 


2464 


ACPSAATMSDQDHSMDEMTAWKIEKGVGGNNGGNGNGGGAFSQ 
ARSSETGSSSSTGGGGQESQPSPr,AT,T,AATCSRIES?NENSNNS 
QGPSQSGGTGELDLTATQLSQGANGWQIISSSSGATPTSKEQSG 
SSTKC-SNGSESSKNRTVSGGQYWAAAPNLQNQQVLTGLPGVMP 
NI QYQVI PQF2TVDGQQLQFAATGAQVQQDGSGQ IQ I 1 PGANQQ 
I ITNRGSGGNI IAAMPNLLQQAVPLQGLANNVLSGQTQYVTNVP 
VALNGNITLLPVNSVSAATLTPSSQAVTIS3SGSQESGSQPVTS 
GTTISSASLVSSQASSSSFFTNANSYSTTTTTSNMGIMNFTTSG 
SSGTNSQGQTPQRVSGLQGSDALNIQQKQTSGGSLQAGQQKEGE 
Q\NQQTQAAPKSLSRPQLVQGG\QALQ\AFQAAPLSGQTFTTQA 
ISQETLQNLQLQAVPNSGPIIIRTPTVGPNGQVSWQTLQLQNLQ 
VQNPQAQTI TLAPMQGVSLGQTSS SNTTLTP I ASAAS I PAGTVT 
/NAAQLSS MPGLQTINLSALGTSG IQVHP I QGLPLAIANAPGDH 
GAQLGLHGAGGDGIHDDTAGGEEGENSPDAQPQAGRRTRREACT 
CPYCKDSEGRGSGDPGKKKQHICHIQGCGKVYGKTSHLRAHLRW 
HTGERPFMCTWSYCGKRFTRSDELQRHKRTHTGEKKFACPECPK 
RFMRSDHLSKHIKTHQNKKGGPGVALSVGTLPLDSGAGS3GSGT 
ATPSALITTNMVAMEAICPEGIARLANSGINVKEGGQFCSPINT 
SAKGF 
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SEQ 
NO: 


Predicted 
nucleotide 

to first 9 
amino acid 

amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 

sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L= Leucine, M=Methionine, N=Asparagine. , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


Sill 


1637 


797 


RVDPRVRGAMAPWGKRLAGVRGVLLDISGVLYDSGAGGGTAIAG 
SVEAVARLKRSRLKVRFCTOESQKSRAELVGQLQRLGFDISEQE 
VTAPAPAACQILKERGLRPYLLIHDGV\ASEFDQIETS/STPNC 
WIADAGESFSYQNMNNAFQVLMELEKPVLISLGKGRYYKETSG 
LMLD VGPYM KALE YACG 1 KAEVGG KP S PE FFKSALQAIGVEAHQ 
AVMIGDDIVGDVGGAQRCGMRALQVRTGKFRPSDEHHPEVKADG 
YVDNLAEAVDLLLQI1ADK 


6112 






KSSHKSFKSKRFLAKKQKPNRPILQWIWLKTGNKIRHNWK 


6113 


1779 


567 


1 3GI 3'JAACGVNLQGAWGERSGVRASEAESPGKRADVSWWSRQL 
ETMVDHLANTEINSQRIAAVESCFGASGQPLALPGRVLLGEGVL 
TKECRKKAKPRIFFLFNDILVYGSIVLNKRKYRSQHI1PLEEVT 
LELLPETLQAKNRWMI KTAKXS F WSAASATERQEWI SHIEECV 
RRQLRATGRPA\STEHAAPWIPDKATDICMRCTQTRFSALTRRH 
HCRKCRVWCAECSRQRFLLPRLSPKPVRVCSLCYRELAAQQRK 
EEAEEQGAGVPRAASHLARPICGRPVEMTMTPTRTRRAAGTATG 
PAAWSSTPRGWPGLPSTADPRPAEHLSPSQLHCPGPQEGSSRSC 
PGLRDPIPWWQVQRWGVALSGLPVPFCWTLCPYGFTAGNAFPFR 
KPQNTHRSW 


6114 


818 


246 


PTSRPRP3PGSPAMSWSACVSAAPSSSWPASSSWPCGPHRCCTR 
RRRCSPRCCLAAGSMCSCSPDWRCTPVPACWPSPPP\PAEQVQC 
GHLPPHADRRALRLPVAAPARGPGPGHPAGPAGPRPARTPPASP 
HGPGRPTVPAPPCPLLAATEPTPSRPHQRWTREDRMLGRGSQVT 
GRPQWFLRGLVLFSL 


6115 


324 


71 


DVCGRVCAHPHLYTH I HMH I CAHAC \ I HTHAQLC / I TASHALA1 1 
1 l TPSHTI PIirAVHKEHRADVLRGTLTPLR 


6116 


595 


1430 


tgvmppgrwhaa/isssgpvfegara\lqtvkkeeedesytpvq 

AARPQTLNRPGQELFRQLFRQLRYHESSGPLETLSRliRELCRWW 
LRPDVLSKAQILELLVLEQFLSILPGELRVWVQLHNPESGEE\L 
WPCWRS CRGTLHGHPGGTRALP \ EPRCALDGYRS \ LRSAQ1 WS L 
ASPLRSSSALGDHLEPPYEIEARDFLAGQSDTPAAQMPALFPRE 
GCPGDQVTPTRSLTAQLQETMTFKDVEVTFSQDEWGWLDSAQRN 
LYRDVMLENYRNMASLGK 


6117 


1433 


222 


VGVPSPAPPCSWEVGPGGGWTPGILKEGQGGRRTPLLLLATRTR 
GLLSLFPPAAMHPAAFPLPVWAAVLWGAAPTRGLIRATSDHNA 
SMDFADLPALFGATLSQE3LQGFLVEAHPDNACSPIAPPPPAPV 
NGSVFIALLRRFDCNFDLKVLNAQKAGYGAAWHKV1JSNELLNM 
VWNSEEIQQQIWIPSVFIGERSSEYLRALFVYEKGARVLLVPDN 
TFPLGYYLIPFTGIVGLLVLAMGAVMIARCIQHRKRLQRNRLTK 
\EQLKQI\PTHDYQKGDQYDVCAICLDEYEDGDKLRVLPCAHAY 
HSRCVDPWLTQTRKTCPICKQPVHRGPGDEDQEEETQGQEEGDE 
GEPRDHPASERTPLLGSSPTLPTSFGSLAPAPLVFPGPSTDPPL 
SPPSSPVILV 


6118 


1044 


247 


S T 1 3 CRACTSGATPGAQSHRSARGHAAGGKETAALGMERGKVKK 
E] E "ET2KEKIGEKGREEKVKRKEVEQKIKQEKQEKQERRKGK 
EKEEKRTKQGKETNKEKEQFKGQEEKGEKKDSTLTRTPLEPLEK 
NKQILVLGLDGAGKTSVLHSLASNRVQHSVAPTQGFHAVCINTE 
DSQMEFLEIGGSKPFRSYWEMYLSN/ADSLARSFSVGFKQDSQP 
ITWKAKKYLHQLIAANPVLPLWFANKQDLEAAYHITDIHEALA 
II 


6119 


1217 


462 


DPRFVTEKTTKAPAQBRTTQPRSSREGTLRSTMEYLSALIJPSDL 
LRSVSNISSEFGRRVWTSAPPPQRPFRVCDHKRTIRKGLTAATR 
QELLAKALETLLLNGVLTLVLEEDGTAVDSEDFFQLLEDDTCLM 
VLQSGQSWSPTRSGVLSYGLGRERPKHSKDIARFTFEVYKQNPR 
DLFGSLNVKATFYGLYSMSCDFQGL\GPKKVLRELLRWTSTLLQ 
GLGHMLLG I SSTLRHAVEGAEQWQQKGRLHS Y 
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ID 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first 

sequence 


corresponding 

amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Iieucine, M=Methionine, N=Asparaglne, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 


6120 




179 


LERAGGGGLSSRALVGSGACLSLVARANGKGLPRGRKEFVEAVR 
VRYVAFRYRTPPJWCLRLWSCRREVIMSGRGKQGGKVRAKAKSR 

ILELAGNAARDNKKTRIIPRHLQLAIRNDEELNKLLGKVTIAQG 
G\VLPNIQAVLLPKKTESQKDEGANDP 


6121 


1612 




FVRAQARGSRQPVRRPLLGAGSRLRCRSCGRMEPLKVEKFATAN 
RGNGLRAVTPLRPGELLFRSDPIAY'TVCiCGSRGWCDRCLLGKE 
KLMRCSQCRVAKYCSAKCQKKAWPDHKRECKCLKSCKPRYPPDS 
VRLLGRWFKLMDGAPSESEKLYSFYDLESNINKLTEDKKEGLR 
QLVMTFQHFMREEIQDASQLPPAFDLFEAFAKVICNS FTICNAE 
MQEVGVGL YPS I SLLNHS CDPNCS I VFNGPHLLLRAVRD IEVGE 
ELT I CYLDMLMTSEERRKQLRDQYCFECD \ CFRCQTQDKDADML 
TGDEQVWKEVQESLKKIEELKAHWKWEQVLAM^QAIISSNSERL 
PDINIYQLKVLDCAMDACINLGLLEBALFYGTRTMEPYRIFFPG 
SHPVRGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHGREH 
SLIEDLILLLE/AMRRQHQSILRERSQRBIRRVSLLNALLRSET 
LCFVS CVNLSYWKFCS VFV 




2 


2324 


RFRKMADGGAASQDESSAAAAAAADSKMNNPSETSKP3MESGEG 
TG : v'i'HC-LDFQKQPVPVGGAl STAQAQAFLGHLHQVQLAGTSL 
QAAAQSLNVQSKSNEESGDSQQPSQPSQQPSVQAAIPQTQLMLA 

ggqitgltltpaqqqi,I)I j qqaqaqaql,IjAaavqqhsasqqhsaa 
gatisasaatpmtqiplsqpiqiaqdlqqlqqlq2qnlnlqqfv 
lvhpttnlqpa\qfiisqtpqgqqgllqa\qnlltqlprqsqan 
llqsqpri\tltsqpatptctiaatpiqtlpqsqstpkr:dtps 
leep\sdleeleqfaktfkqrriklgft\qgdaglamvklygnd 
fspttifrfealnlsfknmcklkpllekwlndaenlssdsslss 
psalnspgieglsrrrkkrtsiea\nirvaleksflei\\qkpts 
eeitmiadqlnmekgvirvwfcnrrqkekrinppssgg\tsssp 
ikaifpsptslvattpslvtssaattltvspvlpltsaavtnls 
vi bl sdttsnwtatvi stap passavts pslsps psasas tsea 
ssasetsttqttstplssplgtsqvmvtasglqta/aqllpfkg 
aaqlpanasiaamaaaaglnpslmapsqfaaggallslnpgtls 
galspalmsnstlatiqalasggslpitsldatgnlvfanagga 
pnivtaplflnpqnlslltsnpvslvsaaaasagnsapvaslha 
tstsaesiqnslftvasasgaasttttaskaq 


6123 


3 


2944 


HLijHRWFGTDMQMINFTTGEFQLTEACPYLGTHSEESRFGILHL 
HLQPLEMKRVGWFTPADYGKVTSLILIRNNLTVIDMIGVEGFG 
ARELLKVGGRLPGAGGSLRFKVPESTLMDCRRQLKDSKQILSIT 
KNFKVEWIGPLPITVSSLKINGYNCQGYGFEVXiDCHQFSLDPNT 
SRDISIVFTPDFTSSWVIRDLSLVTAADLEFRFTLNVTLPHHLL 
PLCADWPGPSWEESFWRLTVFFVSLSLLGVILIAFQQAQYILM 
E7MKTRQRQNASSSSQQNNGPMDVISPHSYKSNCKNFLDTYGPS 
DKGRGKNCLPVNTPQSRIQNAAKRSPATYGHSQKKHKCSVYYSK 
HXTSTAAASSTSTTTEEKQTSPLGSSLPAAKEDICTDAMRENWI 
SLRYASGINVNLQKNLTLPKNIjLNKEESrTLKNTIVFSMPSSECS 
MKEGIQTCMFPKETDIKTSENTAEFKERELCPLKTSKKLPENHL 
PRNSPQYHQPDLPEISRKNNGNNQQVPVKNEVDHCENLKKVDTK 
PSSEKI IHKTSREDMFSEKQDIPFVEQEDPYRKKKLQEKREGNL 
QNLNWSKSRTCRKNKKRGVAPVSRPPEQSDLKLVCSDFERSELS 
SDINVRSWCIQESTREVCKADAEIASSLPAAQREAEGYYQKPEK 
KCVDKFCSDSSSDCGSSSGSVRASRGSWGSWSSTSSSDGDKKPM 
VDAQHFLPAGDSVSQWDFPSEAPISLNLSHNICNPMTGNSLPQY 
AEPSCPSLPAGPTGVEEDKGLYSPGDLWPTPPVCVTSSLNCTLE 
NGVPCVIQESAPVHNSFIDWSATCEGQFSSAYCPLELNDYNAFP 
E ENMN YANGFP CPADVQTDF I DHWS QS TWNTP P\NMPAS \ WGNA 
GFP3SSKPYLKSTPKACLPMSGLFGPl\WAP\QSDVYENCCPIN 
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Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=PhenylaIanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
Ii=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion 
\=possible nucleotide insertion) 








pttehsd/thmenqa\wckeyypgfVnpfraymnldiwttt\a 
nrnanfplsrdssycgnv 


6124 


1573 


236 


SDEALRLAGERGMGRVQLFEISLSHGRWYSPGEPLAGTVRVRI, 
GAPLPFRAIRVTCIGSCGVSNKANDTAWWEEGYFNSSLSLADK 
GSLPAGEHSFPFQFLLPATAPTSFEGPFGKIVHQVRA*\IHTPRF 
SKDHKCSLVFYILSPLNLNSIPDIEQPNVAS^TKKFSYKLVKTG 
SWLTASTDLRGYWGQALQLHADVEIIQSGKDTSPWASLLQKV 
SYKAKRWIHDVRTIAEVEGAGVKAWRRAQWHEQILVPALPQSAL 
PGCSLIHIDYYLQVSLKAPEATVTLPVFIGNIAV/ITPCPSEPPA 
RPGAASWGPTPGG\PSAPPQEEAEAEAAAGGPHFLDPVFLSTKS 
HSQRQPLLATLSSVPGAPEPCPQDGSPASHPLHPPLCISTGATV 
PYFAEGSGGPVPTTSTLILPPEYSSWGYPYEAPPSYEQSCGGVE 
PSLTPES 


6125 




904 


KTCPKLTCAFTVSVPDSCCRVCRGDGELSWEHSDGDIFRQFAMR 
EARHSYHRSHYDPPPSRQAGGLSRFPGARSHRGALMDSQQASGT 
IVQIVINNKHKHGQVCVSNGKTYSHGESWHPNLRAFGIVECVLC 
TCNVTKQECKKIHCPNRYPCKYPQKIDGKCCKVCPG/KKAKEEL 
PGQSF2KKGYFCGEETMPVYESVFMEDGETTRKIALETERPPQV 
EVHVWTIRKGILQHFHIEKISKRMFEELPHFKLVTRTTLSQWKI 
FTEG2AQISQMCSSRVCRTELEDLVKVLYLERSERGHC 


612S 


1224 




RLLSEAPCPRSRRRFQMNPEWGQAFVHVAVAGGLCAVAVFTG1F 
DSVSVQVGYEHYAEAPVAGLPAFLAMPFNSI.VNMAYTLLGLSKL 
KRGGMGLGPRYLKDVFAAMALLxG, '11 1 PRAAVLDQ 
WLTLPIFAWPVAWCLYLDRGWRP\WLFLSLECVSLASYGLALLH 
PQGFEVALGAHWPAVGQALRT\HRHYG/SAT?SATYLALGVLS 
CLGFWLKLCDHQLARWRLFQCLTGHFWSKVCDVLQFHFAFLFL 
THFNTHPR FHP S GG KTR 


6127 


13 3 5 


4S3 


VLPRRCLVFWNTMDSSR3PTLGRLDAAGFWQVWQRFDADEKGY 
IEEKELDAFFLHMLMKLGTDDTVMICANLHKVKQQFMTTQDA3KD 
GRIRMKELAGMFLSEDENFLLLFRRENPLDSSVEFMQIWRKYDA 
DSSGFISAAELRNFLRDLFLHHKKAISEAKLEEYTGTMMKIFDR 
NKDGRLDLNDLARILALQENFLLQFKMDACSTEKRKGDFEKIFA 
YYDVSKTGALEGP \EVDGFVKDMMELVQPS I SGVDLDKFRE ILL 
RHCDVNKDGKIQKSELALCLGLKINP 


612R 


2511 


843 


TCRMSRRQLERWVWSSQQVQARGRKVRAPRLGKIAMGLEMSSKD 
SPGSLDGRAWEDAQKPQSAWCGGRKTRVYATSSRRAPPSEGTRR 
GGAARPEKTAEEGPPAAPGSLRHSGPLGPHACPTALPEPQVTSA 
MSSQWGIEPLYIKAEPASPDSPKGSS3TETEPPVALAPG\PAP 
TRCLPGHKEEEDGEGAGPGEQGGGKLVLSSLPKRLCLVCGDVAS 
GYHYGVASCEACKAFFKRTI QGS IEYS CPASNECE I TKRRRKAC 

FPAGPLAVAGGPRKTAAPVNALVSHLLWEPEKLYAMPDPAGPD 
GHLPAVATLCDLFDREIWTISWAKSIPGFSSLSLSDQMSVLQS 
VWMEVLVLGVAORSLTLQDELAFAEYLVLDEEGARPAGLGELG\ 
AALLQLVRRLQALRLEREEYVLLKALALANSDSVHIEDEPRLWS 
- EI I HE ^LLE YEAGRAGPGGGAERRRAGRLLLTLPLLRQTAG 
KVLAHFYGVKLEGKVPMHKLFLEMLEAMMD 


6129 


1764 


771 


ARFARSAHEGKMPKKKTGARKKAENRREREKQLRASRSTIDLAK 

l?c:r smecdkcqrrqknrafcyfcnsvqklpicaqcgktkcmm 
kssdcvi khagvystglamvgai cdfceawvchgrkclsthaca 
cpltdaec\vecergvwdhggrifscsfchnflceddqfehqas 
cqvleaetfkcvscnrlgqhsclrckacfcddhtrskvfkqekg 
:ghetqetkdlsmstrslkfgrqtggeegdgasgyda 
yhfxr.ssdkygdtsyhdeeedeyeaeddeeeedegrkdsdtess 
dlftnlnlgrtyasgyahyeeqen 


6130 


3 


577 


GRGGTMREYKVWLGSG\GVGKSALTV\QFVTCTFIEKYDPTIE 
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beginning 
nucleotide 
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Amino acid segment containing signal peptide 
<A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, 0=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 
DFYRKEIEV\DSSPSVAGISWTQQGTEQF\ASMRDLYIKKGQGC 
ILVYSLVNQQSFQ\DIKPMRDQIIRVKVSEKVPVI\LVGN\SVD 
LESEREVSSSEGRALAEEWGCPFMETSAKSKTMVDELFAE1VRQ 
MNYAAQPDKDDPCCSACNIQ 



SSPREKTSDSSHRPSRHGFLFLRLVGLSPFSYLCVPPSRPVPGS 
PRSLSAMRLLP1APGRLRRGSPRHLPSCSPALLLLVLGGCLGVF 
GVAAGTRRPNWLLLTDDQDEVLGGMTPLKKTKALIGEMGMTFS 
SAYVPSALCCPSRASILTGKYPHNHHWMNTLEGNCSSKSWQKI 
QEPNTFPAILRSMCGYQTFF\AGKYLNEYGAPDAGGLEHVPLGW 
SYWYALEKNSKYYNYTLSINGKARKHGENYSVDYLTDVLANVSL 
DFLDYKSNFE PFFMMTATP \APHS PWTAAPQYQKAFQNVFAPRN 
KWFNIHGTNKHWLIRQAKTPMTNSSICFLDNAFRKRWQTLLSVD 
DLVEKLVKRLEFTGELNNTYIFYTSDNGYHTGQFSLPIDKRQLY 
EFDIKVPLLVRGPGIKPNQTSKMLVANIDLGPTILDIAGYDLNK 
TQMDGMSLLPILRGASNLTWRSDVLVEYQGEGRNVTDPTCPSLS 
PGVSQCFPDCVCEDAYNNTYACVRTMSALWHLQYCEFDDQEVFV 
EVYNLTADPDQI TNI AKTTDPELLG K n - DM 'Lit 3CSGPTCRT 
PGVFDPGYRFDPRLMFSHRGSVRTRRFSKH LL 

jLPPGLVPEDPRRTRNLLPFGIQGPEFALSRPLFSCVESGW 
AWEAMEPEFLYDLLQLPKGVEPPAE3ELSKGGKKKYLPPTSRKD 
PKFEELQKPA\VLMEWINATLLPEHIWRSLEEDMFDGLILHHL 
FQRLAALKLEAEDIALTATSQKHKLTWLEAWR3\CSWRSGRP 
SGA/WESIFNKDLLSTLHLLVALAKRFQPDLSLPTNVQVEVIT1 
ESTKSGLKSEKLVEQLTEYSTDKDEPPKDVFDELFKLAPEKVNA 
VKEAIVNFVWQKLDRLGLSVQNLDTQFADGVILLLLIGQLEGFF 
LHLKEFYLTPNSPAEMLHNVTLALELL/IGRGPAQLPC/LALK/ 
TIVMKDAKSTLRVLYGLFCKHTQKAHRDRTPHGAPN 
FVHGSMADTDLFMECEEEELEPWQKISDVIEDSWEDYNSVDKT 
TTVSVSQQPVSAPVPIAAHASVAGHLSTSTTVSSSGAQNSDSTK 
KTLVTL I ANNNAGNPLVQQGGQPL ILTQNPAPGLGTMVTQPVLR 
PVQVMQNANHVTSSPVASQPIFITTQGFPVRNVRPVQNAMNQVG 
IVLNVQQGQTVRPITLVPAPGTQFVKPTV3VPQVFSQMTPVRPG 
STMPVRPTTNTFTTVIPATLTIRSTVPQSQSQQTKSTPSTSTTP 
TATQPTSLGQLAVQSPGQSNQTTNPKLAPSFPSPFAVSIASFVT 
VKRPGVTGENSNEVAKLVNTLNTIPS LGQS PGPVWSNNS SAH\ 
G3QRTSGPESSMKVTSSIPVFDLQDGGRKICPRCNAQFRVTEAL 
RGHMCYCCPEMVEYQKKGKSLDSEPSVPSAAKPPSPEKTAPVAS 
/THPSSTP1PALSPPY/TKVPEPNENVGDAVQTKLIMLVUDFYY 
GRDGGKVAQLTNFPKVATS FRCPHCTKRLKNNIRFMNHMKHHVE 
LDQQNGEVDGHTICQHCYRQFSTPFQLQCHLENVHSPYESTTKC 
KICEWAFESEPLFLQHMKDTHKPGEMPYVCQVCQYRSSLYSEVD 
VHFRMIHEDTRHLLCPYCLKVFKNGKAFQQHYMRHQKR\NVYH\ 
CNKCRVQFLFAKDKIEHKLQHHKTFRKPKQLEGLKPGTKVTIRA 
SRGQPRTVPVSSNDTPPSALQEAAPLTSSMDPLPVFLYPPVQRS 
IQKRAVRKMSVMGRQTCLECSFEIPDFPNHFPTYVHCSLCRYST 
CCSRAYANHMINNHVPRKSPKYLALFKNSVSGIKLACTSCTFVT 
SVGDAMAKHLVFNPSHRSSSILPRGLTWIAHSRHGQTRDRVHDR 
NVKNMYPPPSFPTNKAATVKSAGATPAEPEELLTPLAPALPSPA 
STATPPPTPTHPQALALPPLATEGAECLNVDDQDEGSPVTQEPE 
LASGGGGSGGVGKKEQLSVKKLRWLFALCCNTEQAAEHFRNPQ 
RRIRRWLRRFQASQGENLEGKYLSFEAEEKLAEWLTQREQQLP 
VNEETLFQKATKIGRSLEGGFKISYEWAVRFMLRHHLTPHARRA 
VAHTLPKDVAENAGLFIDFVQRQIHNQDLPLSMIVAIDEISLFL 
DTEVLSSDDRKENALQTVGTGEPWCDWLAILADGTVLPTLVFY 
RGQMDQPANMPDSILLEAKESGYSDDEIMELWSTRVKQKHTACQ 
RSKGMLVMDCHRTHLSEEVLAMLSASSTLPAWPAGCSSKIQPL 
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Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid", E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








DVCI KRTVKNFLHKKWKEQAREMADTACDSDVLLQLVLVWLGEV " 
LGVIGDCPELVQRSFLVASVLPGPDGNINSPTRNADMQEELIAS 
LEEQLKLSGEHSESSTPRPRSSPEETIEPESLHQLFEGESETES 
FYGFEEADLDLMEI 


S134 


2 


425S 


fvhgsmadtdlfmeceee3lepwqkisdviedswedynsvdkt 
ttvsvsqqpvsapvpiaahasvaghlststtvsssgaqnsdstk 
ktlvtliannnagnplvqqggqplilt3npapglgtmvtqpvlr 
pvqvmqnanhvtsspvasqpifittqgfpvrnvrpvqnamnqvg 
ivlnvqqgqtvrpitlvpapgtqfvkptvgvpqvfsqmtpvrpg 
stmpvrpttntfttvipatltirstvpqsqsqqtkstpststtp 
tatqptslgqlavqspgqsnqttnpklapsfpsppavs iaspvt 
vkrpgvtgensnevaklvntlntipslgqspgpwvsnnssah\ 
gsqrtsgpessmkvtssipvfdlqdggrkicprcnaqfrvteal 
rghmcyccpemveyqkkgksldsepsvpsaakppspektapvas 
/thpsstpipalsppy/tkvpepnenvgdavqtklimlvddfyy 
grdggkvaqltnfpkvatsfrcpuctkrlknn.tr7mnhmkhhvk 
ldqqngevdghticqhcyrqfstpfqlqchlenvhspyesttkc 
kicewafeseplflqhmkdthkpgempyvcqvcqyrsslysevd 
vhprmihedtrhllcpyclkvfkngkafqqhymrhqkr\nvyh\ 
cnkcrvqflfakdkiehklqhhktfrkpkqleglkpgtkvtira 
srgqprtvpvssnutppsalqeaapltssmdplpvplyppvqrs 
iqkravrkmsvmgrqtclecsfeipdfpnhfptyvhcslcryst 

svgdamakhlvfnpshrsssilprgltwiahsrhgq'l'rdrvhdr 
nvknmypppsfptnkaatvksagatpaepeelltplapalpspa 
statppptpthpqalalpplategaeclnvddqdegspvtqepe 
lasggggsggvgkkeqlsvkklrwlfalccnteqaaehfrnpq 
rrirrwlrrfqasqgenlegkylsfeaeeklaewvltqreqqlp 
vneetlfqkatktgrsleggfkisyewavrfmlrhhltpharra 
vahtlpkdvaenaglfidfvqrqihnqdlplsmivaideislfl 
dtevlssddrkenalqtvgtgepwcdwlailadgtvlptlvfy 
rgqmdqpanmpdsilleakesgysddeimelwstrvwqkhtacq 
rskgmlvmdchrthlseevlamlbarstlpawpagcsskiqpl 
dvcikrtvknflhkkwkeqaremadtacdsdvllqlvlvwlgev 
lgvigdcpelvqrsflvasvlpgpdgninsptrnadmqeelias 
leeqlklsgehsesstprprsspeetiepeslhqlfeghsetes 
fygfeeadldlmei 


6135 


2 




FVH3SMADTDLFMECEEEELEPWQKISDVIEDSWEDYNSVDKT 
TTVSVSQQPVSAPVPIAAHASVAGHLSTSTTVSSSGAQNSDSTK 
KTLVTLIANWNAGKPLVQQGGQPLILTQNPAPGLGTMVTQPVLR 
PVQVMQWANHVTSSPVASQPIFITTQGFPVRNVRPVQNAMNQVG 
IVLNVQQGQTVRPITLVPAPGTQPVKPTVGVPQVFSQMTPVRPG 
STMPVRPTTNTFTTVIPATLTIRSTVPQSQSQQTKSTPSTSTTP 
TATQPTS LGQLAVQS PGQSNQTTKTPKLAPS FPSPPAVS IAS FVT 
VKRPGVTGENSNEVAKLVNTLNTIPSLGQSPGPWVSNNSSAH\ 
GSQRTSG PES SMKVTSS I PVFDLQDGGRKI CPRCNAQFRVTEAL 
RGHMCYCCPEMVEYQKKGKSLDSEPSVPSAAKPPSPEKTAPVAS 
/THPSSTPIPALSPPY/TKVPEPWENVGDAVQTKLIMLVDDFYY 
GRDGGKVAQLTHFPKVATSFRCPHCTKRLKNNIRFMNHMKHHVE 
LDQQNGEVDGHTICQHCYRQFSTPFQLQCHLENVHSPYESTTKC 
KICEWAFESEPLFLQHMKDTHKPGEMPYVCQVCQYRSSLYSEVD 
VHFRMIHEDTRHLLCPYCLKVFKNGNAFQQHYMRHQKR\NVYH\ 
CNKCRVQFLFAKDKIEHKLQHHKTFRKPKQLEGLKPGTKVTIRA 
SRGQPRTVPVSSNDTPPSALQEAAPLTSSMDPLPVFLYPPVQRS 
IQKRAVRKMS VMGRQTCLECS FE I PDFPNHFPTYVHCSLCRYST 
CCSRAYANHMINNEVPRKSPKYLALFKNSVSGIKLACTSCTFVT 
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Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








SVGDAMAKHLVFNPSHRSSSILPRGLTWIAHSRHGQTRBRVHDR 
NVKNMYPPPSFPTNKAATVKSAGATPAEPEELLTPLAPALPSPA 
STATPPPTPTHPQALALPPLATEGAECLNVDDQDEGSPVTQEPE 
IASG3GGSGGVGKKEQLSVKKLRWLFALCCNTEQAAEHFRNPQ 
RRIRRWLRRFQASQGENLEGKYLSFEAEEKLAEWVLTQREQQLP 
VNEETLFQKATKIGRSLEGC-FKISYEWAVRFMLRHHLTPHARRA 
VAHTLPKDVAENACLFIDFVQRQIHNQDLPLSMIVAIDEISLFL 
DTEVLSSDDRKENALQTVGTGEPWCDWLAILADGTVLPTLVFY 
RGQMDQPANMPDS I LLEAKESGYS DDE IMELWS TRVWQKHTACQ 
RSKGMLVMDCHRTHLSEEVLAMLSASSTLPAWPAGCSSK1QPL 
DVCIKRTVKKFLHKKWKEQAREMADTACDSDVLLQLVLVWLGEV 
LGVIGDCPELVQRSFLVASVLPGPDGNINSPTRNADMQEELIAS 
LEEQLKLSGEHSESSTPRPRSSPEETIEPESLHQLFEGESETES 
FYGF2EADLDLMEI 


6 13 fa 


1704 


539 


FGVRMALEGMSKRKRKRSVQEGLWDDGVRGSPPEDYRLGQVAS 
SLFRGEHHSRGGTGRLASLFSSLEPQIQPVYVPVPK\ESALASA 
DLSEEIHQKQGOKRKNSOPGVKVADRKILDDTEDTWSQRKKIQ 
INQEEERLKNERTVFVGHLPVTCNKKKLKSFFKEYGQIESVRFR 
SLIPAEGTLSKKT.AAIKRKIHPDQKNIMAYVVFKEESAATQALK 
RNGAQIADGFRIRVDLASETSSRDKRSVFVGNLPYKVEESAIEK 
HFLDCGSIMAVRIVRDKMTGIGKGFGYVLFENTDSVHLALKLMN" 
SELMGRKLRVMRSVNKEKFKQQNEMPRLKNVSKPKQGLNFTSKT 
AEGHPKSLFIGEKAVLT.KTKKKGQKKSGRPKKQRKQK 


6137 


141 


265S 


1 - j P-RRPAPI 

MWNMLIVAMCLA\LLGLPGKAQELQGHVS\1ILAGEQLGDLAKK 

YLWQG\LFQLYLDEAGRGHSFSFHGAALTAFKQGQELMAKALES 

LSCPKDMAPSHCAEHKDQFLQLSQYRQLKTAEDYQALNKDIEAQ 

LQHAGLREAGG I FYFS VP P FAYED IARNINSSCRPGPGAWLRW 

LEKPFGHDHFSAQQLATELGTFFQEEEMYRVDIIYLGKQAVAQIL 

PFRDQNRKALDGLWNRHHVERVEI 1 MKETVDAEGRTS FYEEYGV 

IRDVLQNHLTEVLTLVAMELPHNVSSAEAVLRHKLQVFQALRGL 

QRGSAWGQYQSYSEQVRRELQKPDSFHSLTPTFAGVLVHIDNL 

RWEGVPFILMSGKALDERVGYARILFK2JQACCVQSEKHWAAAQS 

QCLPRQLVFHIGHGDLGSPAVLVSRNLFRPSLPSSWXEMEGPPG 

LRLFGSPLSDYYAYSPVRERDAHSVLLSHIFHGRKNFFITTENL 

LASWNFWTPLLESLAHKAPRLYPGGAENGRLLDFEFSSGRLFFS 

QQQPEQLVPGPGPGPMPSDFQVLRAKYRESSLVSAWSEELISKL 

j Jtfl i£7'.TAVRAVRRFGQFIILALSGGESPVALFQQIiATAHYGFPW 

AHTHLWLVDERCVPLSDPEStIFQGLQAHLLQHVRIPYYNIH\AM 

i'V.ii/.ORLCAEEDQGAHIYAREISALGANSSFDLVLLGMGADGH 

TASLFPQSPTGLDGEQLWLTTSPSQPHRRMSLELPLINRAKKV 

AVLW.GRMKREITTLVSRVGHEPKKWPISGVLPHSGQLVWYMDY 

DAFLG 


6138 


4587 


934 


EFSKLTDRWQNAVQGVRQRKGDVDGLVRQWQDFTTSVENLFRFL 
TD TSHLLSAVKGQER FSLYQTRSL IHELKNKE IHFQRRRTTCAL 
TLEAGEICLIjLTTDLKTKESVGRRISQLQDSWKDMEPQLAEMIKQ 
FQSTVETWDQCEKKIKELKSRLQVLKAQSEDPLPELHEDLHNEK 
ELIKELEOSLASWTQWLKELQTMKADI.TRHVIjVEDVMVLKEQIE 
HLHE JKEDLCLRVAIRKQEIEDRLNTWWFNEKNKELCAWLVQM 
E 11 /'jCTADISIEEMIEKLQKDCMEEINLFSENKLQLKQMGDQL 
IKASNKSRAAEIDDKLNKINDRWQHLFDVIG5RVKKLKETFAFI 
QQLDKNMSNLRTWLARIESELSKPWYDVCDDQEIQKRLAEQQD 
LQRDIEQHSAGVESVFNICDVLLHDSDACANETECDSIQQTTRS 
LDRRWRNICAMSMERRMKIEETWRLWQKFLDDYSRFEDWLKSAE 
RTAACPNSSEVLYTSAKEELKRFEAFQRQIHERLTQLELINKQY 
RRLARENRTDTASRLKQMVHEGNQRWDNLQRRVTAVLRRLRHFT 
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SEQ 
ID 
NO: 


Predicted 
beginning 

corresponding 
to first 
amino acid 

sequence 


Predicted end 
nucleotide 

amino acid 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








FQQEITLNTNKIDQLIVFGEQLIQKSEP\LDAVLIEDELEELHR 
YCQEVFGRVSRFHRRLTSCTPGLEDEKEASENETDMEDPREIQT 
DSWRKRGESEEPSSPQSLCHLVAPGHERSGCETPVSVDS\IPLE 
WDHTGRRGGPSSSH\EEDEEAQYY\SALSGKSISDGHSWHVPDS 
PSCPEHHYKQMEGDRNVPPVPPASSTPYKPPYGKLLLPPGTDGG 
KEGPRVLNGNPQQEDGGLAG I TEQQSGAFDRWEM IQAQEL \HNK 
LKIKQNLQQLNSDISAITTWLKKTEAELEMLKMAKPPSDIQE1E 
LRVKRLQEILKAFDTYKALWSVNVSSKEFLQTESPESTELQSR 

SAKNRRQKAHVTDPKADPRALLECRRELMQLEKELVERQPQVDM 
LQEISNSLLIKGHGEDCIEAEEKVHVI\EKKLKCLREQVSQDLM 
ALQGTQNPASPLPSFDEVDSGDQPPATSVPAPRAKQFRAVRTTE 
GEEETESRVPGSTRPQRSFLSRWRAALPLQLLLLLLLLLACLL 
PSSEEDYSCTQANNF\ARSFYPMLRYTNGPPPT 


5139 


52 


1131 


L ETPiaVLl RLREGE 
TERRPWBASSWKTL/LAGWIGGAASVIVGHPLDTVKTRLQAGVG 

TQRFLSQHRCGEPEASPPRTLSDLLLASMVAGWSVGLGGPVDL 
IKIRLQMQTPPVSGRQPRFEVQGSGSCG\EPAYQGPVHCITTIV 
RNEGIAGLYRGASAMLLRDVPGYCLYF I P YVFLSEW ITPEACTG 
i S PC -^WJJVGGMAGAISWGTATPMDWKSRLQADGVYI.NKYKGV 

IRGDHAVTSP 


G140 


694 


136 


RPELELWRLRSRSWKP1.GVPRKCHRRNWKEPVRAQPLSVTVWAP 
RCQRP/QPPAPEPSSPNAAVPEAIPTPRAAASAALELPLGPAPV 
SVAPQAEAEARSTPGPAGSRLGPETFRQRFRQFRYQDAAGPREA 
FRQLREL/SPRQWLRPDI \RTKEQ\ IVEMLVQEQLIAILPEAAR 
ARRIRRRTDVRITG 


5141 


2 


984 


AQVGPRSRPCKMPLKLRGKKKAKSKETAGLVEGEPTGAGGGSLS 
ASRAPARRLVFHAQLAHGSATGRVEGFSSIQELYAQIAGAFEIS 
PSE I LYCTLNTP KIDMERLLGGQLGLEDFI FAHVXGIE KE VNVY 
KSEDSLGLTITDNGVGYAFIKRIKDGGVIDSVKTICVGDHIESI 
NGENIVGWRHYDVAKKLKELKKEELFTMKLIEPKKAFEIELRSK 
AGKSSGEKIGCGPJYILRLRSKGPATVEEMPSETKAK\AIEKIDD 
VLELYMGIRDIDLATTMFEAGKDKVNPDEFAVALDETLGDFAFP 
DEFVFDVWGVIGDAKRRGL 


6142 


US 


602 


EAEGEQVCGAKCCGDAPHVENREEETAR I GPGVMES KEERALNN 
LIVENVWQENDEKDEKEQVANKGEPLALPLNVSEYCVPRGNRRR 
FRVKQP I LQYRWD I MHRLGE PQARMREENMERIGEE VRQLMEKL 
REKQLSHSLRAVSTDPPHHDHHDEFC\LMP 


6143 


2802 


270 


frmriflhcpwnqqmwkiwnlletslesckahlsiq;<llker\q 
\qlpvfkhrdsivetlkrhrwwaget\gsgkstqvphflled 
lllneweaskcnivctqprrisavslanrvcdelgcengpggrn 
slcgyoirmesracestrllycttgvllrklqedgllsnvs/hm 
fivdev\her\svqsdflliilkeilqkrsdlhlilmsatvdse 
kfstyfthcpilrxsgrsypvevfhlediieetgfvlekdseyc 
qkfleeeeevtinvtskaggikkyqeyipvqtgahadlnpfyqk 
ysertqhailymnphkinldlilellayldkspqfrniegavli 
jl?qi;ahiqqlydllsndrrfyserykvialhsilstqdqaaaf 
tlp p pgvrki vlatniaetg it i pdwfvidtgrtkewkyhes s 
qmhslvetfvskasalqrqgragrvhdgfcfrmytrerfegfmd 
ysvpe i lrvpleelclhimkcnlgs pedfls kaldppqlqvi sn 
s ini l^kigacelnepkltplgqhlaalpvnvkigkmlifgaif 
gcldpvatlaavmtekspfttpigrkdeadlaksalamadsdhl 
ti ynaylgwkkarqeggyrse i tycrrnflnrtslltledvkqe 
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>idue of 
sequence 



Amino acid segment containing signal peptide 
<A=Alanine, C=Cysteine, D=Aspartic Acid, 
Glutamic Acid, F=Phenylalanine, G=Glycine 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=St 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 
LIKLVKAAGFSSSTTSTSWEGHRASQTLSFQEIALLKAVLVAGL 
YDNVGKI IYTKSVDVTEKLACIVETAQGKAQVHPSSVNRDLQTH 
GWLLYQEKIRYARVYLRETTLITPFPVLLFGGDIEVQHRERLLS 
IDGWIYFQAPVKIAVIFKQLRVLIDSVLRKKLENPKMSLENDKI 

LQIITELIKTENN 

SGPGSMSGQRVDVKVVMLGKEYVGKTSLVERYVHDRFLVGPYQN 
VSASGGARHGGRGSGGPVTCTYGPDLFPLVA\riGAAFVAKVMS 
VGDRTVTLGIWDTAGSERYEAMSRIYYRGAKAAIVCYDLTDSSS 
FERAKFWVKELRSLEEGCQIYLCGTKSDLLEEDRRRRRVDFHDV 
QDYADNIKAQLFETSSKTGQSVDELFCKVAEDYVSVAAFQVMTE 

_ DKGVDLGQKPNPYFYSCCHH 

GGMDLSELERDNTGRCRLSSPVPAVCRKEPCVLGVDEAGRGPVL 
GPMVYAICYCPLPRLADLEALKVADSKTLLES3RERLFAKMEDT 
DFVGWALDVLSPNL1STSMLGRVKYNLNSLSHDTATGLIQYALD 
QGVNVTQVFVDTVGMPETYQARLQQ3FPGIEVTVKAKADALYPV 
\VSAASICAKVARDQAVKKWQFVEKLQDLDTDYG\SGYPNDPQD 
/TKAWLKEHVEPVF\GFP\QFVRF\SWRTAQTI\LEKEAEDVIR 
EDSASEWQEGLRKITSYFLMEGSQARPRESI1RYFLERGLESTTS 

L 

~ -~ i '--EPAEAQQVEALPGPSLDQWHRSAGEEEDGPVLTDEQKS 
R/YPQHEAHDQGG\MDARQSIIRKWDPETGRTRLIKGDGEVLE 
EIVTKKRHREINKQATRGDCIiAFOMRAGLLP 
GTRQL PP P S PGSGPGDS PEGPEGEAPERRRKAHGMLKLYYGLSE 
GEAAGRPAGPDPLDPTDLNGAHFDPEVYLDKLRRECPLAQLMDS 
ETDMVRQIRALDSDMQTLVYENYNKFISATDTIRKMKNDFRKME 
DEMDRLATNMAVITDFSARISATLQDRHERITKLAGVHALLRKL 
QFLFELPSRLTKCVELGAYGQAVRYQGRAQAVLQQYQKLPSFRA 
IQDDCQVITARLAQQLRQRFREGGSGAPEOAECVELLLALGEPA 
EELCEEFLAHARGRLEKELRNLEAELGPSPPAPDVLEFTDHG\S 
SGFVGGLCQVAAAYQELFAAQGPAGAEKLAAFARQL3SRYFALV 
ERRLAQEQGGGDNSLLVRALDRFHRRLRAPGALLAAAGLADAAT 
EIVERVARERLGHHLQGLRAAFLGCLTDVRQALAAPRVAGKEGP 
GLAELLANVASSILSHIKASLAAVHLFTAKEVSFSNKPYFRGEF 
CSQGVREGLIVGFVHSMCQTAQSFCDSPGEKGGATPPALLLLIiS 
RLCLDYETATI S YI LTLTDEQFLVQDQ FPVTPVS TLCAEARETA 
RRLLTHYVKVQGLVISQMLRKSVETRDWLSTLEPRWVRAVMKKV 
VEDTTAIDVQVLPRLAGVALTQAGGTVPSRGAGAAEDHWQSLPG 
GGDMCIWASHGASSVARASVREPQGNKSPRMNTKRAGECLCPRS 
CSFSAQDYDIFAPILPVEKQRLRVTQEVRAGLVLVLKIRPQTNS 
CILPLPHSTGSINSDHVPTK 
VPAVGGTFADGAMGEAEKFHYI YSCDLDINVQLKIGSLEGKREQ 
KSYKAVLEDPMLKFSGLYQETCSDLYVTCQVFAEGKPLALPVRT 
SYKAFSTRWNWNEWLKLPVKYPDLPRNAQVALjTIWDVYGPGEAV 
PVGGTTVSLFGKYGMFRQGMHDLKVWP1ICRSQKDQKPTKTPGRT 
SSTLSEDQMSRLAKLTKAKRQGHMVKVDWLDRLTFREIEMINES 
VKRSSNFMYLMGGFRCVKCDDKEYGIVYYEKDGDESSPILTSFE 
LVKVPDPQMSLENLVESKHHNLPRSLRSGPSDHDLKPYPSPRDQ 
LKNIVSYPPSKPPTYEEQDLVWEFRYYLTNQDKALTKILTSVIW 
DLPQGAKQALALLGKWKPMDVEDSLELLSSHYTNPTVRRYAVAR 
LRQADDEDLLMYLLQLVQALKYENFDDIKNGLEPTKKDSQSSVS 
ENVSNSGINSAEIDSSQIIT/SAPFPSVSSPPP\ASKTKEVPDG 
E ] l^CLCTFLISRASKNSTLAKYLYWYVIVECEDQDTQQRDPK 
T T i£"-1YLII\ r MRRFSQALLKGDKSVRVMRS LLAAQQTFVDRLVHLM 
KAVQRESGNRKKKNERLQALLGDNEKMNLSDVELIPLPLEPQVK 
IRGIIPETATLFKSALMPAQLFFKTEDGGKYPVIFKHGDDLRQD 
QLILQIISLMDKLLRKENLDLKLTPYKVIATSTKHGFMQFIQSV 
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corresponding 



Predicted end 



isidue c 



Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leueine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, K=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyro E ine, X=Unknovm, *=Stop 
Codon, /=poesible nucleotide deletion, 
\=possible nucleotide insertion) 
PVAEVLDTEGSIQNFFRKYAPSENGPNGISAEVHDTYVKSCAGY 
CVITYILGVGDRHLDNLLLTKTGKLFHIDFGYILGRDPKPLPPP 
MKLNKEMVEGMGGTQSEQYQEFRKQCYTAFLHLRRYSNLILNLF 
SLMVDANIPDIALEPDKTVKKVQDKFRLDLSDEEAVHYMQSLID 
ESVHALFAAWEQIHKFAQYWRK 



RVDPRVRENGTANPIKNGKTSPASKDQRTGKKTSVQGQVQKGND 
ESESDFESDPPSPKSSEEEEQDDEEVLQGEQGDFNDDDTEPENL 
GHRPLLMDSEDEEEEEKHSSDSDYEQAKAKYSDMSSVYRDRSGS 
GPTQDLNTILLTSAQLSSDVAVETPKQEFDVFGAVPFFAVKAQQ 
PQQEKNEJOTLPQHRFPAAGLEQEEFDVFTKAPFSKKVNVQECHA 
VGPEAHTIPGYPKSVDVFGSTPFQPFLTSTSKSESNEDIiFGLYP 
FDEITGSQQQKVKQRSLQKXSSRQRRTKQDMSKSNGKRHHGTPT 
STKKTLKPTYRTPERARRHKKVGRRDSQSSNEFLTISDSKENIS 
VALTDGKDRGNVLQPEESLLDPFGAKPFHSPD\LSWHPP\HQGL 
S\DIRADHNT\VLPGR\PRQNSLHGSFHSADVLKMDDFGAVP/F 
LTELWQSITPHQSQC SQPV\ELDPFGAAPFPSKQ 

mswi ifiir .1 3:eteidhdvmteeklhqinnfwsdseyr 

LNKHGSVT.NAVLIMLAQHALLIAISSDLNAYGWCEFDWNDGNC 
QEGWP PMDGSEG I R I TP I DTSG I P 



DSNQQSVSGTAASTLLIISFKATIYYQGTGHVQQFYGVfSPYSQT" 
TPPIVQSYAQPSLQYIQGQQIFTAHPQGVVVQPAAAVTTIVAPG 
QPQPLQPSEMWTNNLLDLPPPSPPKPKTIVLPPNWKTARDPEG 
KIYYYHVITRQTQWDPPTWESPGDDASLEHEAEMDLGTPTYDEN 
PMK\ASKKPKTAEADTSSELAKKSKEVFRKEMSQFIVQCLNPYR 
KVG\RITTTEDFKHLARKLTHGVMNKELKY^KNPE\DLEC 
JNJUMVKHKTKEYIKKYMQKFGAVYKPKEDTEFRVTVGPGWEDGWS 
GKTDSRERKSCGPFCSTPVSTVLLMIHHPGEFNPADVN 



NRTWSTPSTWMGVALPPLCSTGPWPVTRQITARTTCGAVPAKCP 
PWC/DVHEPRCQPPDCHGHGTCVDGHCQCTGHFWRGPGCDELDC 
GPSNCSQHGLCTETGCRCDAGWTGSNCSEECPLGWHGPGCQRPC 
KCEHHCPCDPKTGNCSVSRVKQCLQPPEATLRAGELSFFTRTAW 
LALTLALAFLLLISTAANLSLLLSRAERNRRLHGDYAYHPLQEM 
NGEPLAAEKEQPGGAHNPFKD 



GRVGARSPGRAYALLLLLICFNVGSGLHLQVLSTRNENKLLPKH 
PIILVRQKRAWITAPVALLEGEDLSKKNPIAKIHSDLAEERGLKI 
TYKYTGKGITEPPFGIFVFWKDTGELNVTSILDREKTPFFLLTG 
YALDARGNNVEKPLELRIKVLDINDNEPVFTQDVFVGSVEELSA 
AHTLVMK1NATDADEPNTLNSKISYRIVSLEPAYPPVFYLNKDT 
GEIYTTSVTLDREEHSSYTLTVEARDGNGEVTDKPVKQAQVQIR 
I LDWDN I P WENKVLEGM VEENQVNVE VTRI KVFDADE I GS DN 
WLANFTFASGNEGGYFHIETDAQTNEGIVTLIKEVDYEEMKNLD 
FS VIVANKAAFHKS IRSKYKPTP IPI KVKVKNVKEGIHFKSSVI 
S I YVS ESMDRSSKGQII GNFQAFDEDTGL PAHARYVKLEERDNW 
ISVDSVTSEIKLAKLPDFESRYVQNGTYTVKIVAISEDYPRKTI 
TGTVLINVEDINDNCPTL1EPVQTICHDAEYVNVTAEDLDGHPN 
SGPFSFSVIDKPPGMAEKWKIARQESTSVLLQQSEKKLGRSEIQ 
FLISDNQGFSCPEKQVLTLTVCEVLHGS\GCREAQHDSYVGLGP 
AAIALMI LAFLLLLLVPLLLLMCHCGKGAKGFTP I PGTI EMLHP 
WNNEGAPPEDKWPSFLPVDQGGSLVGRNGVGGMAKEATMKGSS 
3 A3 I VKGQHEMSEMDGRWEEHRSLLSGRATQFTGATGAI \MTTE 
TTITARATGASRDVAGAQAAAVALNEEFLKHYFTDKAASYTEED 
ENHTAKDCLLVYSQEETESLNASIGCCSFIEGELDDRFLDDLGL 
KFKTLAEVCLGQKIDINKEIEQRQKPATETSMNTASHSLCEQTM 
WSENTYSSGSSFPVPKSLQEANAEKVTQEIVTERSVSSRQAQK 
VATPLPDPMASRNVIATETSYVTGSTMPPTTVILGPSQPQSLIV 
TERVYAPASTLVDQPYANEGTWVTERVIQPHGGGSNPLEGTQH 
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SEQ 
ID 
NO; 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first 

amino acid 

sequence 


Predicted end 

to first 
residue of 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine , R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X-Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








LQDVPYVMVRERESFLAPSSGVQPTLAMPNIAVGQNVTVTERVL 
APASTLQSSYQIPTENSMTARNTTVSGAGVPGPLPDFGLEESGH 
SNSTITTSSTRVTKHSTVQHSYS 


S154 


3660 


2146 


KKKTKMKNTLQKTVNFGAWP KPT I SDKSHLLQMVSKLDLTDAKN 
SDTAHIKS IEITS ILNGLQASES SAEDSEQEDERGAQDMDNNGK 
EESKIDHLTNNRNDLISKEEQNSSSLLEENKVHADLVISKPVSK 
SPERLRKD1EVLSEDTDYEEDEVTKKRKDVKKDTTDKSSKP3IK 
RGKRRYCNTEECLKTGSPGKKEEKAKNKESLCMENSSNSSSDED 
EEETKAKMTPTKKYNGLEEKRKSLRTTGFYSGFSEVAEKRIKLL 
NNSDERLQNSRAKDRKDTOSSICGQWPKKTLKELFSDSDTEAAA 
SPPHPAPEEGVAEESLQTVAEEESCSPSVELEKPPPVNVDSKPI 
EEKTVEVNDRKAEFPSSGSNFSA* IPLPYLHLNRLHQSL*QKGS 
RQQSSVTVSEPLAPNQEEVRSIKSETDST1BVDSVAGELQDLQS 
ERE * LASRF* CQCELEQ* * SARTRTS * KSLYRSEKSERCSGRRK 
FIKKAEKKP * SNSGKQQKEGK 


6155 


869 


121 


HLLPELRGKSWITMkH L t HVAPYLV 
YT.KSHFNPCVGVLIKPSWVT.APAHCYLPNLKVMLGNFKSRVRDG 
TEQTINPIQIVRYWNYSHSAPQDDLMLIKLAKPAMLNPKVQALN 
P\PTTNVRPGTVCLLSGLDW3QENSGRHPDLRQNLEAPVMSDRE 
CQKTEQG KSHRNSLCVKFVKVFSR I FGEVAVATVI CKDKLQGIE 
VGHFMGGDVGIYTHVYKYVSWIENTAKDK 


6156 


5725 


3984 


r " 'T I^TKKHFSIILNIJ,GMLLKKDNaDTRKLLMTWALEVAV 
VMKKSETYAPLFCLPSFHKFCKGLLADTLVEDOTillCLQACSSLH 
ALSSSLPDDLLQRCVDVCRVQLVHRGTCIRQAFGKLLKSIPLGV 
FLSNNWHTEIQEISLALRSHMSKAPSWTFHPQDFSD/VISFILY 
GNSHRTGKDNWLERLF YS CQRLDKRDQS TI PRNLLKTDAVLWQW 
AIWEAAQFTVLSKLRTPLGRAQDTFQTIEGIIRSLAGHTLNPDQ 
DVSQWTTADNDEGHGNNQLRLVLLLQYLENLEKLMYNAYEGCAN 
ALTSPPKVIRTFLYTNRQTCQDWLTRIRLSIMRVGLLAGQPAVT 
VRHGFDLLTEMKTTSLSQGNELEVSIMMWEALCELHCPEAIQG 
IAVWSSSIVGKHLLWINSVAQQAEGRFEKASVEYQEHLCAMTGV 

PEVINYLGNKACECYISTADWAAVQEWQNAIHDLKKSTSSTSLN 
LKADFNYIKSLSSFESGKFVECTEQLELLPGENINLLAGGSKEK 
IDMKKLLRNM 


6157 


946 


329 


MANRGPS YGLSRE VQEKI EQKYDADLENKLVDWI I LQCAED IEH 
PPPGRAHFQKWLMDGTVLCKL INSLYP PGQEP I PKI SESZMAFK 
QMEQISQFLKAAETYGVRTTDIFQT\DLWEGKDMAAVQRTLriAL 
GSVAVTKDDGCYRGEPSKFHRKAQQNRRGFSEEQLRQGQNVIGL 
QMGSNKGASQAGMTGYGMPRQIM*DAASCP 


6153 


441 


1482 


LGSLIVLSLHCKVIFSSQSLERAMKEKAVDLVPILAQNPGLAQN 
P I LEG KDHNQNTGVD P I I DHVQDRKTD /SRSKSPHKKRSKSRER 
RKSRSRSHSRDKRKDTREKIKEKERVKEKDREKEREREKEREKE 
KERGKNKDRDKEREKDREKDKEKDREREREKEHEKDRDKEKEKE 
QDKEKEREKDRSKEIDEKRKKDKKSRTPPRSYNASRRSRSSSRE 
RRRRRSRSSSRSPRTSKTI KRKS SRS P S PRSRNKKDKKRE KERD 
HISERRERERSTSMRKSSNDRDGKEKLEKNSTSLKEKEHNKEPD 
SSVSKEVDDKDAPRT3ENKIQHMGNCQLNEENLSTKTEAV 


6159 


53 


84 


AVIAPLHISLGDRARPYLKNTEKSSTTCSRRRNQSFPPVMSLTH 
RLHLCKYWGCAVSNVCRFWEGRPLPLMIWPYTLPVSLPVGSCV 
IITGTPILTFVKDPQLEVNFTTGMDEDSDIAFQFRLHFGHPAIM 
NSCVFGIWRYEEKCYYLPFEDGKPFELCIYVRHKEYKVMWGQR 
IYNFAHRFPPASVKMLQVFRDISLTRVLISD*GRCVRITAVQEF 
DVSVSCDCTTAYQPG 


61S0 


1626 


1790 


AGAKFFP*F*KVADAQPTESEKEIYNQYHWLKDAEGILEDLQS 
YRGAGHEIREAIQHPADEKLQEKAWGAWPLVGKLKKFYEFSQR 
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SEQ 
WO: 


beginning 
nucleotide 

corresponding 
to first 
amino acid 

sequence 


Predicted end 
nucleotide 

corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, B=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonir.e , V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








LEAALRGLLGALTSTPYSPTQHLEREQAIAKQFAEILHFTLRFD 
ELKMTNPAIQNDFSYYRRTLSEMRINm r PAEGENE\TOTELANRM 
SLFYAEATPMLKTLSDATTKFVSENKNLPIENTTDCLSTMASVC 
RVMLETPEYRSRFTNEETVSFCl.RVMVGVIILYDHVHPVGAFAK 
TSKIDMKGCIKVLKDQPPNSVEGLLNALRYTTKHLNDETTSKQI 
KSMLQ*QLLTLVNKG 


E161 


455 


1569 


PVSGSESSLRRAWAS ILRLMLGPRVAVS ILCED3ISH*LLEKH* 
KSHVLEPLSSLALEEQCLALSLDWSTGKTGRAGDQPLKIISSDS 
TGQLHLLMVNETRPRLQKVASWQAHQFEAWIAAFNYWHPEIVYS 
GGDDGLLRGWDTRVPGKFLFTSKRHTMGVCSIQSSPHREHILAT 
GSYDEHILLWDTRNMKQPIADTPVQGGWRIKWHPFHHHLLLAA 
CMHSGFKILMCQKAMEERQEATVLTSHTLPDSLVYGADWSWLLF 
RSLQRAPSWSFPSNLGTKTADLKGASELPTPCHBCREDNDGEGH 
ARPQSGMKPLTEGMRKNGTWLQATAATTRDCGVNPEEADSAFSL 
LATCSFYDHALHLWEWEGN 


6162 




586 


RTIHATGRAGASPMHRLJ.VWRLAEAI^KQHVP.CQRL'LEFUHWTYE 
> L E T AELKKALKE KENRLLLQQS IGETNVERKAK 
KKRSXSVTSSSSSSSDSSASDSSSESEETSTSSSSEDSDTDESS 
SSSSSSASSTTSSSSSDSDSDSSSSSKQ*HQHR*QL*R*TTKEE 
EKE IELLHS YWTDGLKTLM 


61S3 




785 


R I RS"TKGCAVRLHPTQMTGKAR IM I LLSVS LGRHWAFT YKFFL 
A". >'*'■'*• HRKE * VMQKNPMKSREDEWMEKLNNLHVQRAE 
MNRLI MNYLVTEGFKEAAEKFRMESG I EPSVDLETLDERI KI RE 
MILKGQIQEAIALINSIiHPELLETNRYLYFHLQQQHLIELIRQR 
ETEAALEFAQTQLAEQGEESRECLTEMERTLALLAFDSPEESPF 
GDLLHTMQRQKVWSEVNQAVLDYENRESTPKLAKLLKLLLWAQN 
ELDQKKVKYPKMTDLSKGVIEEPK 


6Z64 


90 




PCQSPGRSRMRQDKLTGSLRRGGRCLKRQGGGVGTILSNVLKKR 
SCISRTAPRLLCTLEPGVDTKLKFTLEP3LGQNGFQQWYDALKA 
VARLSTGIPKEWRRKVWLTLADHYLHSIAIDWDKTMRFTFNERS 
NPDDDSMGIQIVKDLHRTGCSSYCGQEAEQDRWLKRV1LAYAR 
WtTKTVGYCQGFNILAALILEVMEGNEGDALKIMIYLIDKVLPES 
YFVNNLRALSVDMAVFRDLLRMKLPELSQHLDTLQRTANKESGG 
GYEPPLTNVFTMQWFLTLFATCLPNQTVLKIWDSVFFEGSEIIL 
RVSLAIMAKLGEQIECCETADEFYSTMGRLTQEMLENDLLQSHE 
LMQTVYSMAPFPFPQLAELREKYTYNITPFPATVKPTSVSGRHS 
KARDSDEENDPDDEDAWKAVGCLGPFSGFLAPELQKYQKQIKE 
PNEEQ SLRSNNI AELS PGAINS CRSE YHAAFNSMMMERMTTDIN 
ALKRCYSRIKKKQQQQVHQVYIRADKGPVTSILPSQWSSPVIN 
HLLLGKKMKMTNRAAKWAVIHIPGHTGGKISPVPYEDLKTKLWS 
PWRTHIRVHKKNMPRTKSHPGCGDTVGLIDEQNEASKTNGLGAA 
EAFPSGCTATAGREGSSPEGSTRRTIEGQSPEPVFGDADVDVSA 
VQAKLGALELNQRDAAAETELRVHPPCQRHCPEPPSAPEENKAT 
SKAPQGSNSKTPIFSPFPSVKPLRKSATARNLGLYGPTERTPTV 
HF?QMSRSFSKPGGGNSGP*KMVFSSGTMLSRQLPGYPQEYQRN 


6165 


90 


406 


PCQSPGRSRMRQCKLTGSLHKGGRCLKRQGGGVGTILSNVLKKR 
SCISRTAPRLLCTLEPGVDTKLKFTLEPSLGQNGFQQWYDALKA 
VARLSTGIPKEMRRKVWLTLADHYLHSIAIDWDKTMRFTPNERS 
M? DDDSMG I Q IVKDLHRTGCSSYCGQEAEQDRWLKRVLIiAYAR 
KKKrVGYCQGFNILAALILEVMEGNEGDALKIMrYLIDKVLPES 
YFWINLRALSVDMAVFRDLLRMKLPELSQHLDTLQRTANKESGG 
GYEPPLTNVFTMQWFLTLFATCLPNQTVLKIWDSVFFEGSEIIL 
RVSLAIWAKLGEQIECCETADEFYSTMGRLTQEMLENDLLQSHE 
LMQTVY3MAPFPFPQLAELREKYTYNITPFPATVKPTSVSGRHS 
KARDSDEENDPDDEDAWNAVGCLGPFSGFLAPELQKYQKQIKE 
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SEQ 
ID 
NO: 


Predicted 
nucleotide 

to first 
amino acid 


Predicted end 

corresponding 

residue of 
amino acid 
sequence 


Amino acid segment containing- signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 

P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, X^Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








PNEEQSLRStmiAELSPGAINSCRSEYHAAFNSMMMERMTTDIN 
ALKRQYSRI KKKQQQQVHQVY IRADKGPVTS ILPSQVNSS PVIN 
HLLLGKKMKMTNRAAKNAVIHI PGHTGGKI S PVPYEDLKTKLNS 
PWRTHIRVHKKNMPRTKSHPGCGDTVGLIDEQNEASKTNGLGAA 
EAFPSGCTATAGREGSSPEGSTRRTIEGQSPEPVFGDADVDVSA 
VQAKLGALELNQRDAAAETELRVHPPCQRHCPEPPSAPEENKAT 
SKAPQGSNSKTPIFSPFPSVKPLRKSATARNLGLYGPTERTPTV 
HFPQMSRSFSKPGGGNSGP*KMVFSSGTMLSRQLPGYPQEYQRN 
GGERFG 


61S6 




1206 


HKLWRTVAMAGAEWKSLEECLEKHLPLPDLQEVKRVLYGKELRK 
LDLPREAFEAASREDFELQGYAFEAAEEQLRRPRIVHVGLVQNR 
I PL PANAPVAE QVS ALHRR I KAI VE VAAMCGVNI I CFQEAWTMP 
FAFCTREKLPWTEFAESAEDGPTTRFC3KLAK1JHDMVWS P ILE 
RDSEHGDVLWNTAVVISNSGAVLGKTRKNH1 ?RVG JFNESTYYM 
EGNLGHPVFQTQFGRIAVNICYGRHHPLNWLMYSINGAEIIFNP 
SATIGALSESLWPIEARNAAIANHCFTCAINRVGTEHFPNEFTS 
GDGKKAHQDFGYFYGSSYVAAPDSSRTPGLSRSRDGLLVAKLDL 
NLCQQVNDVWNFKMTGRYEMYARELAEAVKSNYSPTIVKE*PAS 
VPA1G 


61S7 


1220 


1844 


YGIVTGPSLCAGDKQPKKQEKNPVLVSPEFVDEALCACEEYLSN 
LAHMDIDKDLEAPLYLTPEGWSLFLQRYYQVVHEGAELRHLDTQ 
VQRCEDILQQLQAWPQIDMEGDRNIWIVKPGAKSRGRGIMCMD 
HLEEMLKLVNGNPWMKDGKWWQKYIERPLLIFGTKFDLRQWF 
LVTDWWPL LDK*APLYLTPEGW3 
LFLQRYYQWHEGAELRHLDTQVQRCEDILQQLQAWPQIDMEG 
DRNIWIVKPGAKSRGRGIMCMDHLEEMLKLVNGNPWMKDGKWV 
VQKYIERPLLIFGTKFDLRQWFLVTDWNPLTVWFYRDSYIRFST 
QPFSLKNLDK 








VWPVPSVSAMPPKKQAQAGGSKKAEQKKKEKIIEDKTFGLKNKK 
GAKQQKFIIUWTHQVKFGQQNPRQVAQSEAEKKLKKDDKKKELQ 
ELNELFKP WAAQ K I S KGAD P KSWCAFFKQGQ CTKGDKCKFSH 
DLTLERKCEKRSVYIDARDEELEKDTMDNWDEKKLEEWNKKHG 
EAEKKKPKTQIVCKHFLEAI ENNKYGWFWVCPGGGDICMYRHAL 
PPGFVLKKKKKKKKKEDE1SL*DLIERERSALCPNVTKITLESF 
LAWKKRKRQEKIDKLEQDMERRKADFKAGKALVISGREVFEFRP 
ELVNDDDEEADDTRYTQGTGGDE VDDSVS VND IDLS LYI PRDVD 
ETG I T VAS LERFS T YTSDKDENKLS EASGGRAEWGERS DLEEDW 
EREGTENGAIDAVPVDENLFTGEDLDELEEELNTLDLEE 








AASVFVLYATSCANNFAMKGKRKTLNASDVLSAMEEMEFQRFVT 
PLKEALEAYRREQKGKKEASEQKKKDKDKKTDSEEQDKSRDEDN 
DEDEERLEEEEQNEEEEVDN*KGRETVAPWKVPLEMRRATCFCE 
AFPCWAE 


6170 




567 


STKVMLPNTGRLAGCTVFITGASRGIGKAIALKAAKDGANIVIA 
AKTAQPHPKLLGTIYTAAEEIEAVGGKALPCIVDVRDEQQISAA 
VEKAIKKFGGIDILVNNASAISLTNTLDTPTKRLDLMMNVNTRG 
TYLAS KAC I PYLKKS KVAHI PNI SPPLNLNPVWFKQHCGRW* W 
G*GDGLCLICFELNLCMSDVITICT 


6171 


382 


941 


HFMQSDVELDCDIEPCGHTKFPPTLPLSTTVIVCSCHPVATAST 
MAEAFSKTTSEEDQSIQEPKEANSMTAQKQKK*GLRGSRRRHAN 
SGGDIFGDSFAAYFPRVLKQVHQALSLSQEAVSVMDSMVRDILD 
RIATEAGHLAHYSKCVTITSRDIRMAVCLLLPGKMGKLAESQGT 
NATLRYTKSK 


S172 


651 


54 


GLCRAGGAHRFSRTHVEAALKMLRREARLRREYLYRKAREEAQR 
SAQERKERLRRALEENRLIPTELRREALALQGSLEFDDAGGEGV 
TSHVDDEYRWAGVEDPKVMITTSRDPSSRLKMFAKELKLVFFGA 
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NO: 


Predicted 
beginning 

nucleotide 

corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 

corresponding 

amino acid 

residue of 

sequence 


(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 

\=possible nucleotide insertion) 








QRMNRGRHEVGALVRACKANGVTDLLWHEHRGTPVGijIVSHLP 
FGPTAYFTLCNWMRHDIPDLGTMSEAKPHLITHGFSSRLGKRV 
S D I LR YLFP VPKDDS HRV I T FANQDDY I S FRHHV YKKTDHRNVE 

LSTE*AAPRPLGQLL 


6173 


3 




SVDHREVQVIjSQSMPLTPHQAVLRGERPYMCVECGKCFGRSSHL 
LQHQRIHTGEKPYVCSVCGKAFSQSSVLSKHRTIHTGEKPYECN 
ECGKAFRVSSDLAQHHKIHTGEKPHECLECRKAFTQLSHLIQHQ 
RIHTGERPYVCPLCGKAFNHSTVLRSHQRVHTGEKPHRCNECGK 
TFSVKRTLLQHQRIHTGEKPYTCSECGKAFSDRSVLIQHHNVHT 
GEKPYECSECGKTFSHRSTLMNHERIHTEEKPYACYECGXAFVQ 
HSHLIQHQKVHRKL* PTCVLSVGSALAGVPTSFSISVSTLERSP 
MCAVYVGRPSARAQSLVNTGQFTQVRSPMSVMSVEKPLE 


S174 


10S0 


959 


PRPPGKRWMVAGLGNPGLPGTRHSVGMAVLGQLARRLGVAESKT 
RDRHCAADLALAPLGDAQLVLLRPRRLMNANGRSVARAAELFGL 
TAEEVYLVHDEtiDKPLGRLALKLGGSARGHNGVRSCISCLNSNA 
MPRLRVGIGRPAHPEAVQAHVLGCF3PAEQELLPLLLDRATDLI 
LDHIRERSQGPSLGP*H*WFSKKA 








RYFRADPRSRSGQPIIAEGLGAFAECFjI-'II.-." *' SIiRKQSTEG 
DALDFPASPKPAGKQNGIQNPISLEDSPEAGGEREEEQEREEEQ 
AFLVSLYKFMKERHTPIERVPHLGFKQINLWKIYKAVEKLGAYE 
LVTSRRLWKNVYNELGGS PGSTSGATCTRRHY * RLVLP YVRHLK 
GEDDKPLPTSKPRKQYKMAKENRGDDGATERPKKAKEERRMDQM 

PEAYKRLLSSFYCKGTHGIMSPLAKKKLLAQVSKVEALQCQEEG 
CRHGAEPQASPAVHLPESPQSPKGLTENSRHRLTPQEGLQAPGG 
SLREEAQAGPCPAAPIFKGCFYTHPTEVLKPVSQHPRDFFSRLK 

KPKACWVSPMAKVPAESPTLPPTFPSSPGLGSKRSLEEEGAAHS 
GKRLRAVSPFLKEADAKKCGAKPAGSGLVSCLLGPALGPVPPEA 
YRGTMLHCPLNFTGTPGPLKGQAALPFSPLVIPA7PAHFLATA3 
PSPMAAGLMHFPPTSFDSALRHRLCPASSAWHAPPVTTYAAPHF 
FHLNTKL 


S17S 






GVREGQTQVDTPQIGDMAYWSHPIDLHFATKGLQGWPRLHFQVW 
SQDSFGRCQLAGYGFCHVPSSPGTHQLACPTWRPLGSWREQLAR 
AFVGGGPQLLHGDTIYSGADRYRLHTAAGSTVHLEIGLLLRNFD 
RYGVEC*GTLPPTSPPSTPRTPSDGGGWHSGQEHRL 


6177 


1400 


992 


VPIESLVGKVHNFPLIAFYCCEKGKRQPHKSLHDRCFGEALDPN 
CSHCYLDQIKRSDFLGFSGYSPHFVAISTNSEHKMQPSSMQQAL 
PSQ*PYWTDPRPALVPCCSHRPDVHRSRPGPGLPGTSGCSDRP? 
VCPI 


5178 


1027 


254 


STQRGGI KGVARAAS LVGRRRAGTGMALLLCLVCLTAALAHGCL 
HCHSNFSKKFSFYRHHVNFKSWWVGDIPVSGALLTDWSDDTMKE 
LHLAI PAKI TREKLDQVATAVYQMMDQLYQGKM YFPGYFPNELR 
NIFREQVHLIQNAIIESR1DCQHRCGIFQYETISCNNCTDSHVA 
CFGYNCESSAQWKSAVQGLLNYINNWHKQDTSMRPRSSAFSWPG 
TKRAA?AFLVLPALRCLEPPHLANLSLEDAA*CLKQH 


S179 


80S 


276 


} -f,j -MAGNLLSGAGRRLWDWVPLACRSFSLGVPRLIGIRLTL 
PPPKWDRWNEKRAMFGVYDNIGILGNFEKHPKELIRGPIWLRG 
WKGNELQRCIRKRKMVGSRMFADDLHNLNKRIRYLYKHFNRHGK 
FR*KRKLRTSEKAHLSPWRRETVLFPVRKRLCIFSVIKWGFFGI 


6180 


1SS 




DHHILKAASTTHVCARGNlFAIPNTRCLEC*ATATPSSLECQN i 
SHLSLCPLPATTSGLTPNSMIPEKERQNIAERLLRVMCADLGAL 
SWSGKEFLKLAQTIiVDSGARYGAFSVTEILGNFNTLALKHLPR 
MYNQVK\ r KVTCALGSNACLGIGVTCHSQSVGPDSCYILTAYQAE | 
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" Predicted end 



Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalaniae, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=St 
Codon, /^possible nucleotide deletion, 
_ \^possible nucleotide insertion) 

GNHIKSYVLGVKGADIRDSGDLVHHWVQBVLSEFVMSEIRTVYV 
TDCRVSTSAFSKAGMCLRCSACALNSWQSVLSKRTLQARSMHE 
VIELLNVCEDLAGSTGLAKETFGSLEETSPPPCWNSVTDSLLLV 
HERYEQICEFYSRAKKMNLIQSLNKHLLSNLAAILTPVKQAVIE 
LSN3SQPTLQLVLPTYVRLEKLFTAKANDAGTVSKLCHLFLEAL 
KENFKVHPAHKVAMILDPQQKLRPVPPYQHEEIIGKVCELINEV 
KESWAEEADFEPAAKKPRSAAVENPAAQEDDRLC-KNEVYDYLQE 
PLFQATPDLFQYWSCVTQKHTKLAKLAFWLLAVPAVGARSGCVN 
MCEQALLIKRRRLLSPEDMNKLMFLKSNML 
TRTLLSPVLLPGPRWKPWRRRPMGPLALPAWLQPRYRKNAYLFI 
YYLIQFCGHSWIFTNMTVRFFSFGKDSMVDTFYAIGLVMRLCQS 
VSLLELLHIYVGIESNHLLPRFLQLTERIIILFWITSQEEVQE 
KYWCVLFVFWNLLDMVRYTYSMLSVIGISYAVLTWLSQTLWMP 
IYPLCVLAEAFAIYQSLPYFESFGTYSTKLPFDLSIYFPYVLKI 
YLMMLFIGM YFTYSHLYS ERRD ILGIFPI KKKKM* S TAFQCDTR 
KDRLWIQCSK*NTGSILVEKFLVF 

1 rfi PKLRi LI EDM TTeij: ~ 

VRPFGSSVNTFGKLGCDLDMFLDLDETRNLSAHKISGNFLMEFQ 
VKNVPSERIATQKILSVLGECLDHFGPGCVGVQKILNARCPLVR 
FSHQASGFQCDLTTNNRIALTSSELLYIYGALDSRVRALVFSVR 
CWARAHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPILPTLDSL 
KTLADAEDKCVIEGNHCTFVRDLSRIKPSQNTETLELLLKEFFE 
Y?GNFAFDKNSINIRQGREQNKPDS3PLYIQNPFETSLNISKNV 
SQSQLQKFVDLARESAWILQQEDTDRPSISSNRPWGLVSLLLPS 
APNRKSFTKKKSNKFAIETVKNLLESLKGNRTENFTKTSGKRTI 

STQT 

HLDRYIKSPGSGSSTPAPPSHLLLYLLHPQSTRTMGCCGCSRGC 
GSGCGGCGSSCGGCGSGCGGCGSGRGGCGSGCGGCSSSCGGCGS 
RCYVPVCCCKPVCSWVPACSCTSCGSC3GSKGGCGSCGGSKGGC 
GSCGCSQSSCCKPCCCSSGCGSSCCQSSCCKPCCCQSSCCVPVC 
COSSCCKPCCCQSNCCVPVCCQCKI*GSGPRPSGFSCLVKAFLM 
VP 

IVTVREEDGAPAVAPPGWVSRANKRSGAGPGGSGGGGARGAEE 
EPPPPLQAVLVADSFDRRFFPISKDQPRVLLPLANVALIDYTLE 
FLTATGVQETFVFCCWKAAQIKEHLLKSKWCRPTSLNVVRII' 
ELYRSLGDVLRDVDAKALVRSDFLLVYGDVISNINITRALEEHR 
LRRKL* KNVSVMTMIFKESS PSHPTRCHEDNWVAVDSTTNRVL 
HFQ KTQGLRRFAF PLSLFQGS SDGVE VRYDLLD CHISICS PQVA 
QLFTDNFDYQTRDDFVRGLLVNEEILGNQIHMHVTAKEYGARVS 
NLHMYSAVCADVIRRWVYPLTPEANFTDSTTQSCTHSRHNIYRG 
PEVSLGHGSILEENVLLGSGTVIGSNCFITNSVIGPGCHIEPGD 
NWLDQTYLWQGVRVAAGAQIHQSLLCDNAEVKERVTLKPRSVL 
TSQWVGPNITLPEGSVISLHPPDAEEDEDDGEFSDDSGADQEK 
DKVKMKGYNPAEVGAAGKGYLWKAAGMNMEEEEELQQNLWGLKI 
NMEEESESESEQSMDSEEPDSRGGSPQMDDIKVFQN3VLGTLQR 
GKEENISCDNLVLEIMSLKYAYNISLKEVMQVLSHWLEFPLQQ 
MDSPLDSSR YCALLLPLLKAWS PVFRNYI KRAADHLEALAAI ED 
FFLEHEALGISMAKVLMAFYQLEILAEETILSWFSQRDTTDKGQ 

QLRKNQQLQRF IQWLKEAEEESSEDD 

PCTSCVLWATLHLPASTRKAPQAECGMISITEKQKIGVGITGFG 
IFFILFGTLLYFDSVLLAFGNLLFLTGLSLIIGLRKTFWFFFQR 
HKLKGTSFLLGGWIVLLRWPLLGMFLETYGFFSLFKGFFPVAF 
GFLGNVCNIPFLGALFRRLQGTSSMV*KTEMSSLNLDHWLKGAK 
REEWEPPPQSPALTHSPTYPGPPQVQKERNGAEQLTSNPQVDSR 

GCQEAEMQTPRRLGWGWYHTLTLYLWEEK 

VYGIDSSI TI 1 I "LKKHWKLCHAQSRLDVNGLALKMA 
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SEQ 
MO: 


Predicted 
beginning 

corresponding 
to first 


Predicted end 
nucleotide 

corre spending 

residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, OCysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenyl alanine, G=Glycine, 
H^-Histidine, I = Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








KERKVKNKVKNKADTEEVFNNSPTNQEKMPTSAILPDFSGSVIS 
NIRNQMETLHSQPHQEENLCFENSFSLINLLPINAVEPTSSQQI 
PMRETSEANKERRKMTSKSSESWIYSPLTSFITADSELHDIIKD 
LEDCLMVGLHTCGm .APNTLR I FTSNS E I KGVCSVGCCYHLLSE 
EFENQHKERTQEKWGFPMCKYLKEERWCCGRNARMSACLALERV 
AAGQSLPTESLFYRAVLQDIIKDCYGITKCDRHVGKIY8KCSSF 
LDYVRRSLKKLGLDESKLPEKIIMNYYEKYKPRMNELEAFKMLK 
WLAPCIETwILLDRLCYLKEQEDIAWSALVKLFDPVKSPRCYA 
VIALKKQQ*?PLKQIIRCISL*DSA3CAEEVSVGDGGPALRDAP 
PSGSRVGSRYD 


6187 


1701 


771 


DAWGPETRLARILNPDSFIEPRPGRLPELEATRPHMEPXASCPA 
AAPL.MERKFHVLVGVTG3VAALKLPLLVSKLLDIPGLEVAWTT 
ERAKHFYSPQDIPVTLYSDADEWEMWKERSDPVLHIDLRRWADL 
LLVAPLDANTLGKVASGICDNLLTCVMRAWDRSKPLLFCPAMNT 
AMWEHP I TAQQVDQLKAFGYVE I P CVAKKLVCGDEGLGAMAEVG 
TIVDKVKEVLFQHSGFQQS*PGISVMGVPLYSEWVQAKSVKMDV 
GKIGGYPHLLNGGPALSLPRGOACSRLNWTEGPGLSFFQPGEAA 
A 


5188 


238 


1534 


1 - 1 GPLMAET.C r <_ Ji-APEMSQICLSCGHPSA*GPRWASW 
NIGVFICIRCAGIHRNLGVHISRVKSVNLDQWTQEQIQCMQEMG 
NGKANRLYEAYLPETFRRPQIDPAVEGFIRDKYHKKKYMDRSLD 
INAFRKEKDDKWKRGSEPVPEKKLEPWFEKVKMPQKKEDPQLP 
RKSSPKSTAPVMDLLGLDAPVACSIANSKTSNTLEKDLDLLASV 
PSPSSSGSRKWGSMPTAGSAGSVPENLNLFPEPGSKSEEIGKK 
QLSKDSILSLYGSQTPQMPTQAMFMAPAQMAYPTAYPSFPGVTP 
PNSIMGSMMPPPVGMVAQPGASGMVAPMAMPAGYMGGMQASMMG 
VPNGMMTTQQAGYMAGMAAMPQTVYGVQPAQQLQWNLTQMTQQM 
AGMWFYGANGMMNYGQSMSG3NEQAANQTLS PQMWK 


6189 


1297 


793 


LGEPLGDLCELIPGDVQQLQMGEVHPGTGAQGSAAQSVAGEVQL 
TQLSHARQRPSCQGSQLIALDLQHMDISRQPRWQHVQPVARQVQ 
RAQQAQLAEGVAVHLWAGDAWAEVELLQEVGGGKVFAANACDL 
WQDHEGAHAARQATGHALQRVIVQVRRVQPLEAL*RVPSGLPR 
RVRAFMILHNQITGIGREDFATTYFLEELNLSYNRITSPQVHRD 
AFRKLRLLRSLDLSGNRLHMLPPGLPRNVHVLKVKRNELAALAR 
GALAGMAQLRELYLTSNRLRS RALGPRAWVDLAHLQLLD IAGNQ 
LTEIPEGLPESLEYLYLQMNKISAVPANAFDSTPNLKGIFLRFN 
KLAVGSWDSAFRRLKHLQVLDIEGNLEFGDISKDRGRLGK3KE 
EEEEDEVEEEETR 


S190 


66 


1309 


ILVGHVSFLLSFAEYVCITCSWGSLNVNRCNQTTGQCECRPGYQ 
GLHCETCKEGFYLNYTSGLCQPCDCSPHGALSIPCNSSGKCQCK 
VGVIGSICDRCQDGYYGFSKNGCLPCQCNNRSASCDALTGACLN 
CQENSKGNHCEECKEGFYQSPDATKECLRCPCSAVTSTGSCSIK 
SSELEPECDQCKDGYIGPNCNKCENGYYNFDSICRKCQCHGHVY 
PVKTPKICKPESGECINCLHNTTGFWCENCL*GYVHDLEGNCIK 
KVILPTPEGSTILVSNASLTTSVPTPVINSTFTPTTLQTIFSVS 
TSENSTSALADVSWTQFNI I ILTVI 1 1 WVLLMGFVGAVYMYRE 
YQKRKLNAPFWTIELKEDKISFSSYHDSIPNADVSGLLEDDGNE 
VAPNGQLTLTTPIHNYKA 


6191 




1511 


VNLCHGGLLHLSTHHLGIKPSMH*LFFLMLSFPHLTPQQPKCPS 
MIDWIKKIWYIYTMEYYATIKRNEIMFFAGTWMEMEAIILSKLM 
QDYMFSL1SGS 


6192 




950 


n iGCGH KI4AGKKNVLSSLAVYAEDSEPESDGEAG I EAVGSAAEE 
KGGLVSDAYGEDDFSRLGGDEDGYEEEEDENSRQSEDDDSETEK 
PEADDPKDNTEAEKRDPQELVASFSERVRNMSPDEIKIPPEPPG 
RCSNHLQDKIQKLYERKIKEGMDMNYIIQRKKEFRNPSIYEKLI 
QPCAIDELGTNYPKDMFDPHGWSEDSYYEALAECAQKIEMDKuEK 
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SEQ 
ID 


Predicted 
beginning 
nucleotide 

corresponding 
to first 

amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 

amino acid 

amino acid 
sequence 


Atn_no acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=HiGtidine, I-Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine , V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








AKKERTKIEFVTGTKKGTTTNATSTTTTTASTAVADAQKRKSKW 
DSAIPVTTIAQPTILTTTATLPAVVTVTTSASGSKTTVISAVGT 
IVKKAKQ 


61S3 


3 


950 


TRGCGNXMAGKKNVLSSLAVYAEDSEPESDGEAGIEAVGSAAEE 


PEADDPKDNTEAEKRDPQELVASFSERVRNMSPDEIKIPPEPPG 
RCSNHLQDKI QKL YBRX I KEGMDMN Y I IQRKKE FRNPS I YEKL I 
QFCAIDELGTNYPKDMFDPH3WSEDSYYEALAKAQKIEMDKLEK 
AKKERTKIEFVTGTKKGTTTNATSTTTTTASTAVADAQKRK3KW 
DSAIPVTTIAQPTILTT7ATLPAWTVTTSASGSKTTVISAVGT 
IVKKAKQ 


6194 


■ 


95 0 


TRGCGNKMAGKKNVLSSLAVYAEDSEPESDGEAGIEAVGSAAEE 
KGGLVSDAYGEDDFSRLGGDEDGYEEEEDEKSRQSEDDDSETEK 
PEADDPKDNTEAEKRDPQELVASFSERVRNMSPDEIKIPPEPPG 
RCSNHLQDKI QKL YERKI KEGMDMNYI I QRKKEFRNPS IYEKLI 
QFCAIDELGTNYPKDMFDPHGWSEDSYYEALAKAQKIEMDKLEK 
AKKERTKIEFVTGTKKGTTTNATSTTTTTASTAVADAQKRKSKW 
DSAIPVTTIAQPTILTTTATLPAVVTVTrSASGSKTTVISAVGT 
IVKKAKQ 


6195 


73S 


235 


VANGLQSNMPKFYCDYCDTYLTHDSPSVRKTIIC3GRKHKFNVKD 
YYQKWMEEQAQS L I DKTTAAFQQGKI P PTPFSAPP PAGAM I PPP 
PSLPGPPRPGMMPAPHMGGPPMMPMMGPPPPGMMPVGPAPGMRP 
PMGGHMPMMPGPPMMRPPARPMMVPTRPGMTRPDR 




1512 


623 


KTGi:P.R3AAYVRHILDNAEQVISNLEARNLGPRLTPLLCEEDSH 
QRLLMGLMVSELKDHFLRHLQGVEKKKIEQMVLDYI3KLLDLIC 
HIVETNWRKHNLHSWVLHFNSRGSAAEFAVFHIMTRILEATNSL 
FLPLPPGFHTLHTILGVQCLPLHNLLHCIDSGVLLLTETAVIRL 
MKDLDNTEKNEKLKFSIIVRLPPLIGQKICRLWDHPMSSNIISR 
NHVTRLLQNYKKQPRNSKINKSSFSVEFLPLNYFIEILTDIESS 
NQALYPFEGHDNVDAEFVEEAALKHTAMLLGL 


6197 


3 




ADPEGTEEAVMSRYTRPFNTSLFIRNVADATRPEDLRREFGRYG 
PIVDVYIPLDFYTRRPRGFAYVQFEDVRDAEDALYNLNRKWVCG 
RQIEIQFAQGDRKTPGQMKSKERHPCSPSDHRRSRSPSQRRTRS 
RSSSWGRNRRRSDSLKESRHRRF3YSQSKSRSKSLPRRSTSARQ 
SRTPRRNFGSRGRSRSKSLQKRSKSIGKSQS3SPQKQTSSGTKS 
RSHGRHSDSIARSPCKSPKGYTNFETKVQTAKHSHFRSHSRSRS 
YRHKNSW 


6193 


111 


1512 


S3AALSPSFISPACFLLRKLPAL3DGTLPHPDTLGMNYEGARSE 
RENHAADDSEGGALDMCCSERT.PGLPQPIVMEALDEAEGLQDSQ 
REMPPPPPPSPPSDPAQKPPPRGAGSHSLTVRSSLCLFAASQFL 
LACGVLWFSGYGHIWSQNATNLVSSLLTLLKQLEPTAWLDSGTW 
GVPSLLLVFLSGGLVLVTTLVWHLLRTPPEPPTPLPPEDRRQSV 
SRQPSFTYSEWMEEKIEDDFLDLDPVPETPVFDCVMDIKPEADP 
TSLTVKSMGLQERRGSNVSLTLDMCTPGCNEEGFGYLMSPREES 
AREYLLSASRVLQAEELHEKALDPFLLQAEFFEIPMNFVDPKEY 
DIPGLVRKNRYKTILPNPHSRVCLTSPDPDDPLSSYINANYIRG 
'-'GGi:EI'.VYIATQGPIVSTVADFWRMVWQEHTPIIVMITNIEEMN 
EKCTEYWPEEQVAYDGVEITVQKVIHTEDYRLRLISLKSGTEER 
GLKHYWFTSWPDQKTPDRAPPLLHLVREVEEAAQQEGPHCAPII 
VHCSAGIGRTGCFIATSICCQQLRQEGWDILKTTCQLRQDRGG 
MIQHCEQYQFVHHVMSLYEKQLSHQSPE 


61S9 


144 


1211 


u T E iJEESSSWKKQAEDIKKIFEFKETLGTGAFSEVVLAEEKA 
TC-r.L,FAVKCIPKKALKGKESSIENEIAVLRKIKHENIVALEDIY 
ESPNHLYLVMQLVSGGELFDRIVEKGFYTEKDASTLIRQVLDAV 
YYLHRMGIVHRDLKPENLLYYSQDEESKIMISDFGLSKMEGKGD 
VMSTACGTPGYVAPEVLAQKPYSKAVDCWSIGVIAYILLCGYPP 
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Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H^Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginir.e, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=pcssible nucleotide insertion) 
Ty['™dsXLFEQILKAEYEFDSPYWDDISDSAKDFIRNLMEKDP 
NKRYTCEQAARHPWIAGDTALNXNIHESVSAQIRKNFAKSKWRQ 
AFNATAVVRHMRKLHLGSSLDSSNASVSSSLSLASQKDCASGTF 
HAL* 

LPEVPHSLRPRVKPHLCCAQPAVRVMARLPKLAVFDLDYTLWPF 
WVDTHVDPPFHKSSDGTVRDRRGQDVRLYPEVPEVLKRLQSLGV 
PGAAASRTSEIEGANQLLELFDLFRYFVHREIYPGSKITHFERL 
QQKTGIPFSQMIFFDDERRNIVDVSKLGVTCIHIQNGMNLQTLS 
QGLETFAKAQTGPLRS S LEES P FEA 

GQTPRVRWKMRRSLRAGKRRQTAGRKSKSPPKVPIVIQDDSLPA 
GPPPQIRILXRPTSNGWSEPNSTSRPTLPVKSLAQREAEYAEA 
RKRILGSASPEEEQEKPILDRPTRISQPEDSRQPNNVIRQPLGP 



INADRAAVASSLLSRPT^KKAPOKDRKPKRSTWRFNLDLTHPVE 
DGIFDSGNFEQFLREKVKVNGK7GNLGKWHIERFKNKITWSE 
KQFSKRYLKYLTKKYLKKNNLRDWLRWASDKETYELRYFQISQ 



RCPRPPATAGAAASRPDRSPPSGISGSEAAAGAGAAAPASQHPA 
TGTGAVQTEAMKQILGVIDKKLRNLEKKKGKLDDYQERMNKGER 
LNQDQLDAVSKYQEVTMvJLEFAKELQRSFMALSQDIQKTIKKTA 
RREQLMREEAEQKRLKTVLELQYVLDKLGDDEVRTDLKQGLNGV 
PILS3EELSLLDEFYKLVDPERDMSLRLNE0YEHASIHLWDLLE 
GKEKPVCGTTYKVLKEIVERVFQSNYFDSTHNHQNGLCEEEEAA 
SAPAVEDQVPEAEPEPAEEYTEQSEVESTEYVNRQFMAETQFTS 
GEKEQVDEWTVETVEWNSLQQQPQAASPSVPEPHSLTPVAQAD 
PLVRRQRVQDLMAQMQGPYNFIQDSMLDFENQTLDPAIVSAQPM 
NPTQNMDMPQLVCPPVHSESRLAQPNQVPVQPEATQVPLVSSTS 
EGYTASQPLYQPSHATEQRPOKEPIDQIQATISLNTDQTTASSS 
LPAASQPQVFQAGTSKPLHSSGI^^VKAAPFQSMQTVF^JMNAPVP 
PVNEPETLKOQNQYQASYNQSFSSQPHQVEQTELQQEQLQTWG 
TYHGSPDQSHQVTGNHQQPPQQNTGFPRSNQPYYNSRGVSRGGS 
RGARGLMNGYRGPANGFRGGYDGYRPSFSNTPNSGYTQSQFSAP 
RDYSGYQRDGYQQNFKRGSGQSGPRGAPRGRGGPPRPNRGMPQM 
NTQQVN 

CTHHLISLLGGRALIHFNRFLNLKIQEGEAHNIFCPAYDCFQLV 
PGDII KS WS KEMDKRYLQFD IKAFVENNPAI KWCPTPGCDRAV 
RLTKQGSNTSGSDTLSFPLLRAPAVDCGKGHLFCWECLGEAHEP 
CDCQTWKNWLQKITEMKPEELVGVSEAY3DAANCLWLLTNSKPC 
ANCKSPIQKNEGCNHMQCAKCKYDFCWICLEEWKKHSFVHWEVI 
YRCTRYEVIQHVEEQSKEMTVEAEKKHKRFQELDRFMHYYTRFK 
NHEHSYQLEQRLLKTAKEKMEQLSRALKETEGGCPDTTFIEDAV 
HVLLKTRRILKCSYPYGFFLEPKSTKKEIFELMQTDLEMVTEDL 
AQKVNRPYLRTPRHKIIKAACLVQQKRQEFLASVARGVAPADSP 
3APRP.SFAGGTWDWEYLGFASPEEYAEFQYRRRERQRRRGDVHS 
LLSNPPDPDEPSESTLDIPEGGSSSRRPGTSWSSASMSVLHSS 
SLRDY7PASRSENQDSLQALSSLDEDDPNILLAIQLSLQESGLA 
LDEETRDFLSNEASLGAIGTSLP3RLDSVPRNTDSPRAALSSSE 
LLELGDSLMRLGAENDPFSTDTLSSHPLSEARSDFCPSSSDPDS 
AGQDPNINDNLLGNIMAWFHDMNPQSIALIPPATTEISADSQLP 
CIKDGSEGVKDVELVLPEDSMFEDASVSEGRGTQIEENPLEENI 
PGGGKQHPQAW 

RAHl GKMAjEVGDMEDGQLSDSDSDMTVAPSDRPLQLPKVLGGD 
SAMRAFQNTATACAPVSHYRAVESVDSSEESFSDSDDDSCLWKR 
KRQXCFNPPPKPEPFQFGQSSQKPPVAGGKKINNIWGAVLQEQN 
QDAVATELGILGMEGTIDRSRQSETYNYLLAKKLRKESQEHTKD 
LDXELDEYMHGGKKMGSKEEBNGQGHLKRKRPVKDRLGNRPEMM 
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YK3RYEITAEDSQEKVADEISFRLQEPKKDLIARWRIIGNKKA 

QI KD I FY I ENQKE YENKKAARKRRTQVLGKKMKQAI KSLNFQED 
DDTSRETFASDTNEALASLDESQEGHAEAKLEAEEAIEVDHSHD 
LDIF 


6206 


10 


1442 


IISERRERSCLHLVCIRCSCDWEMGSVLGLCSMASWIPCLCGS 
APCLLCRCCPSGNNSTVTRLIYALFLLVGVCVACVMLIPGMEEQ 
LNKIPGFCENEKGWPCNILVGYKAVYRLCFGLAMFYLLLSLLM 
I KVKS S SDPRAAVHNGFWFFKFAAAI AI I IGAFF I PEGTFTTVW 
FYVGMAGAFCFILIQLVLLIDFAHSWNESWVEKMEEGNSRCWYA 
ALLSATALNYLLSLVAIVLFFVYYTHPASCSENKAFISVNMLLC 
VGASVMSILPKIQESQPRSGLLQSSVITVYTMYLTWSAMTNEPE 
TNCNPS LLS I IGYNTTSTVPKEGQSVQWWHAQG I IGL I LFLLCV 
FYSSIRTSNNSQVNKLTLTSDESTLIEDGGARSDGSLEDGDDVH 
RAVDNERDGVTYSYSFFHFMLFLASLYIMMTLTNWYRYEPSREM 
KS QWTAVWVKI SSSWIGI VL Y VWTLVAP LVLTNRD FD 


«, 


2324 




-VI4AEAATPGTTATTSGAGAAAATAAAASPTPIPTVTAPSLGAG 
GGGG3SDGSGGGWTKQVTCRYFMHGVCKEGDNCRYSHDLSDSPY 
SWCKYFQRGYCIYGDRCRYEHSKPLKQEEATATELTTKSSLAA 
SSST.SSIVGPLVEMNTGEAESRNSNFATVGAGSEDWVNAIEFVP 
GQPYCX3RTAPSCTEAPLQGSVTKEESEKEQTAVETKKQLCPYAA 

. JE -xOENCVYLHGDSCDMCCLQVLHPMDAAQRSQHIKSCIEA 
H3KDMELS FAVORS KDMVCGICMEWYEKANPSERRFGILSNCW 

KEEKQKLILKYKEAMSNKACRYFDEGRGSCPFGGNCFYSHAYPD 
GEEESPQRQKTCTSSEYEAQRRNHFWELIEERENSNPFDNDEEE 
W7F3LGEMLLMLLAAGGDDELTDSEDEWDLFHDELEDFYDLDL 


6208 


2924 


1171 


TVMAEAATPGTTATTSGAGAAAATAAAASPTPIPTVTAPSLGAG 
GGGGGSDGSGGGWTKQVTCRYFMHGVCKEGDNCRYSHDLSDSPY 
SWCXYFQRGYCIYGDRCRYEHSKPLKQEEATATELTTKSSLAA 
SSSLSSIVGPLVEMNTGEAESRNSNFATVGAGSEDWVNAIEFVP 

VGECRYGENCVYLHGDSCDMCGLQVLHPMDAAQRSQHIKSCIEA 
HEKDMELSFAVQRSKDMVCGICMEWYEKANPSERRFGILSWCN 
HTYCLKCIRKWRSAKQFESKIIKSCPECRITSNFVIPSEYWVEE 
KEEKQKLILKYKEAMSNKACRYFDEGRGSCPFGGNCFYKHAYPD 
GRREEPQRQKVGTSSRYRAQRRNHFWELIEERENSNPFDNDEEE 
WTFELGEMLLMLLAAGGDDELTDSEDEWDLFHDELEDFYDLDL 


6209 


1758 


829 


ERLCFPCMQSKIYSYMSPNKCSGMRFPLQEENSVTHHEVKCQGK 
PLAG I YRKREEKRNAGNAVRSAMKSEEQKI KDARKGPLVP FPNQ 
KSEAAEPPKTPPSSCDSTNAAIAKQALKKPIKGKQAPRKKAQGK 
TQQNRKLTDFYPVRRSSRKSKAELQSEERKRIDELIESGKEEGM 
KIDLIDGKGRGVIATKQFSRGDFWEYHGDLIEITDAKKREALY 
AQDPSTGCYMYYFQYLSKTYCVDATRETNRLGRLINHSKCGNCQ 
TKLHDI DGVPHL I LIASRD I AAGEELLYDYGDRSKAS IEAHPWL 
KH 


6210 


3761 


387 


IFGMSKjRMVLLEDSGSADFRRHFVNLSPFTITWLLLSACFVT 
SSLGGTDKELRLVDGENKCSGRVEVKVQEEWGTVCNNGWSMEAV 
SVICNQLGCPTAIKAPGWANSSAGSGRIWMDHVSCRGNESALWD 
JKilLJ'.iGKKSNCTHQQDAGVTCSDGSNLEMRLTRGGNMCSGRIE 
IKFQ - ; ?>WGrVCDDNFNIDHASVICRQLECGSAVSFSGSSNFGEG 
S3P I WFDDLICNGNE SALWKCKHQGWGKHNCDHAEDAGVI CS KG 
ADLSLRLVDGVTECSGRLEVRFQGEWGTICDDGWDSYDAAVACK 
QLGCPTAVTAIGRVNASKGFGHIWLDSVSCQGHEPAVWQCKHHE 
WGKHYCKHNEDAGVTCSDGSDLELRLRGGGSRCAGTVEVEIQRL 
LGKVCDRGW3LKEADWCRQLGCGSALKTSYQVYSKIQATNTWL | 
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JLSSCNGNETSLWDCKNWQWGGLTCDHYEEAKITCSAHREPRLV 
GGDIPCSGRVEVKHGDTWGSICDSDFSLEAASVLCRELQCGTVV 
SILGGAHFGEGNGQIWAEEFQCEGHESHLSLCPVAPRPEGTCSH 
SRDVGWCSRYTEIRLVNGKTPCEGRVELKTLGAWGSLCNSHWD 
IEDAHVLCQQLKCGVALSTPGGARFGKGNGQIWHHMFHCTGTEQ 
HMGDCPVTALGASLCPSEQVASVICSGNQSQTLSSCNSSSLGPT 
RPTIPEESAVACIESGQLRLVNGGGRCAGRVEIYHEGSWGTICD 
DSWDLSDAHWCRQLGCGEAINATGSAHFGEGTGPIWLDEKKCN 
GKESRIWQCHSHGWGQQNCRHKEDAGVICSEFMSLRLTSEASRE 
ACAGRLEVFYNGAWGTVGKSSMSETTVGWCRQLGCADKGKINP 
ASLDKAMSIPMWVDNVQCPKGPDTLHQCPSSPWEKRLASPSEET 
WITCDNKIRLQEGPTSCSGRVEIWHGGSWGTVCDDSWDLDDAQV 
VCQQLGCGPALKAFKEAEFGQGTGPIWLNEVKCKGNESSLWDCP 
ARRWGHSECGHKEDAAVNCTDISVQKTPQKATTGRSSRQSSFIA 
VGILGWLIAIFVALFFLTKKRRQRQRLAVSSRGENLVHQIQYR 
EMNSCLNADDLDLMNSSGGHSEPH 


5211 


3761 


387 


IFGMSKLF'.rV EDSGSA FRRH FVNT iS P FTI T WLLLS ACFVT 
SSLG3TDKELRLVDGENKCSGRVEVKVQEEWGTVCWNGWSMEAV 
SVICNQLGCPTAIKAPGWANSSAGSGRIWMDHVSCRGNESALWD 
CKHDGWGKHSNCrHQQDAGVTCSDGSNLEMRLTRGGNMCSGRIE 
IKFQGRWGTVCDDNFNIDHASVICRQLECGSAVSFSGSSNF3EG 
S G P I WFDDL I CNGNESALWNCKHQGWGKHNCDHAEDAGVI CS KG 
ADLSLRLVDGVTECSGRLEVRFQGENGTICDDGWDSYDAAVACK 
QLGCPTAVTAI GRVNAS KGFGH IWLDS VS CQGHE PAVWQCKHHE 
WGKHYCNHNEDAGVTCSDGSDLELRLRGGGSRCAGTVEVEIQRL 
LGKVCDRGWGLKEADWCRQLGCGSALKTSYQVYSKIQATNTWL 
FLSSCNGNETSLWDCKNWQWGGLTCDHYEEAKITCSAHREPRLV 
GGD I P CSGRVE VKHGDTWGS I CDS DFS LEAAS VL CRELQ CGT W 
SILGGAHFGEGNGQIWAEEFQCEGHESHLSLCPVAPRPEGTCSH 
SRDVGWCSRYTEIRLVWGKTPCEGRVELKTLGAWGSLCNSHWD 
IEDAHVLCQQLKCGVALSTPGGARFGKGNGQIWRHMFHCTGTEQ 
HMGDCPVTALGASLCPSEQVASVICSGNQSQTLSSCNSSSLGPT 
RPTIPEESAVACIESGQLRLVNGGGRCAGRVEIYHEGSWGTICD 
DSWDLSDAHWCRQLGCGEAINATGSAHFGEGTGPIWLDEMXCN 
GKESRIWQCHSHGWGQQNCRHKEDAGVICSEFMSLRLTSEASRE 
ACAGRLEVFYNGAWGTVGKSSMSETTVGWCRQLGCADKGKINP 

WITCDNKIRLQEGPTSCSGRVEIWHGGSWGTVCDDSWDLDDAQV 
VCQQLGCGPALKAFKEAEFGQGTGPIWLNEVKCKGNESSLWDCP 
ARRWGHSECGHKEDAAVMCTD1SVQKTPQKATTGRSSRQSSFIA 
VGILGVVLLAIFVALFFLTKKRRQRQRLAVSSRGENLVHQ1QYR 
EMNSCLNADDLDLMNSSGGHSEPH 


5212 


1 


1134 


LK'flELRPGGAVWGTGRGAGTGAPRSCCCQTNPGPPSSLRRAFRR 
RELPFPACHEIGLGAEAGSGPPPAPAARESRSRAMEEEASSPGL 
GCSKPHLEKLTLGITRILESSPGVTEVTIIEKPPAERHMISSWE 
OKNNCVMPEDVKNFYLMTNGFHMTWSVKLDEHI IPLGSMAINS I 
SKLTQLTQSSMYSLPNAPTLADLEDDTHEASDDQPEKPHFDSRS 
VIFELDSCNGSGKVCLVYKSGKPALAEDTEIWFLDRALYWHFLT 
DTFTAY YRLL I THLGL PQWQYAFTS YG I S PQAKQRVSMYKP I T Y 
NTNLLTEETDSFVNKLDPSKVFKSKNKIVIPKKKGPVQPAGGQK 
GPSGPSGPSTSSTSKSSSGSGNPTRK 


6213 




1134 


LKWELRPGGAVWGTGRGAGTGAPRSCCCQTNPGPPSSLRRAFRR 
RELPFPACHEIGLGAEAGSGPPPAPAARESRSRAMEEEASSPGL 
GCSKPHLEKLTLGITRILESSPGVTEVTIIEKPPAERHMISSWE 
QKNNCVMPEDVKNFYLMTNGFHMTWSVKLDEHIIPLGSMAINSI 
SKLTQLTQSSMYSLPNAPTLADLEDDTHEASDDQPEKPHFDSRS 
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Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, X=Dnknown, *=Stop 
Codon, /=possible nucleotide deletion, 
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VIFHIDSCNGSGKVCLVYKSGKPALAEDTEIWFLDRALYWHFLT 
DTFTAYYRLLITHLGLPQWQYAFTSYGISPQAKQRVSMYKPITY 
NTNLLTEETDSFVNKLDPSKVFKSKNKIVIPKKKGPVQPAGSQK 
GP£ -TEGPSTSSTSKSSSGSGNPTRK 


6214 




460 


KEIAPSAIRPAARLGLGPARWQSRAAAFYFVRGFRTGWSFVGWV 
VLGTSAKRTRLFFFLSKMAASSRAQVLALYRAMLRESKRFSAYN 
YRTYAVRRIRDAFRENKNVKDPVEIQTLVNKAKRDLGVIRRQVH 
IGQLYSTDKLIIENRDMPRT 


6 215 


2 


1B49 


FVAGGPRGSGSAAETMPEIRVTPLGAGQDVGRSCILVSIAGKNV 
MLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHFHLDK 
CGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAWKKGE 
ANFFTSQMIKDCMKKWAVHLHCTVQVDDELEIKAYYAGHVLGA 
AMFQIKVGSESWYTGDYNMTPBRHLGAAWIDKCRPNLLITEST 
YATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELC 
ILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKT 
FVQRNMFEFKHIXAFDRAFADNPGPMWFATPGMLHAGQSLQIF 
RKWAGNEKNMVIMPGYCVQGTVGHKIIiSGQRKLEMKGRQVLEVK 
MQVEYMS FSAHADAKG I MQLVGQAE P E S VLLVHGEAKKME FLKQ 
KIEQELRVNCYMPANGETVT1PTSPSIPVGISLGLLKREMAQGL 
LPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTC 
RVHLHDTRKEQETALRVYSH1KSVLKDHCVQHLPDGSVTVESVL 
LQAAAPSEDPGTKVLLVSWTYQDEELGSFLTSLLKKGLPQAPS 


6216 


11 


3 93 


QTTRPEPP.NSALRQSRSKMAWGVSSVSRLLGRSRPQLGRPMSS 
GAHGEEGSARMWKTLTFFVALPGVAVSMLNVYLKSHHGEHERPE 
FIAYPHLRIRTKPFPWGDGNHTLFh Ii 1 111 l - )t- 


6217 


9 


1173 


trvgrgf.sglkmevkpppgrpqpdsgrrrrrrgeeghdpkepeq 
lrklfigglsfettddslrehfekmgtltdcwmrdpqtkrsrg 
fgfvtyscveevdaamcarphkvdgrwepkravsredsvkpga 
hltvkkifvggikedteeynlrdyfekygkietievmedrqsgk 
krgfafvtfddhdtvdkiwqkyhtinghncevkkalskqemqs 
agsqrgrgggsgnfmgrggnfgggggnfgrggnfggrggygggg 
ggsrgsygggdggyngfggdggnygggpgyssrggyggggpgyg 
nqgggygggggydgyneggnfgggnyggggnyndfgnysgqqqs 
nygpmkggsfggrsSgepygggygsgggsggygsrrf 


62.1 H 


1305 


906 


SCERRGFIMADDLKRFLYKXLPSVEGLHAIVVSDRDGVPVIKVA 
NDNAPEHALRPGFLSTFALATDQGSKLGLSKNKSIICYYNTYQV 
VQFNRL.PLWSFIASSSANTGHVSLEKELAPLFEELRQWEVS 


6219 


2 


890 


AGPGEGAGAGTRCAGAEAEMASAGGEDCESPAPEADRPHQRPFL 
IGVSGGTASGKSTVCEKIMELLGQNEVEQRQRKWILSQDRFYK 
VLTAEQKAKALKGQYNFDHPDAFDNDLMHRTLKNIVEGKIVEVP 
TYDFVTHSRLPETTWYPADWLFEGILVFY3QEIRDMFHLRLF 
VDTDSDVRLSRRVLRDVRRGRDLSQILTQYTTFVKPAFEEFCLP 
TKKYADVIIPRGVDNMVAINLIVQHIQDILNGDICKWHRGGSNG 
RSYKRTFSEPGDHPGMLTSGKRSHLESSSRPH 


6220 


227 


764 


E , FISLEMSCTIEKALADAKAIiVERLRDHDDAAESL^EQTTALN 
KRVEAMKQYQEEIQELNEV7ARHRPRSTLVMGIQQENRQIRELQQ 
ENKELRTSLEEHQSALELIMSKYREQMFRLLMASKKDDPGIIMK 
LKEQHSKIDMVHRNKSEGFFLDASRHILEAPQHGLERRHLEANQ 
NVH 


6221 


98 


915 


RWIWDLNPVSDGLELRPKYNGILHCLTTIWKLDGLRGLYQGVTP 
NIWGAGLSWGLYFVFYNAIKSYKTEGRAERLEATEYLVSAABAG 
AMTLCITNPLWVTKTRLMLQYDAWNSPHRQYKGMFDTLVKIYK 
YEGVRGLYKGFVPGLFGTSHGALQFMAYELLKLKYNQHINRLPE 
AQLSTVEYISVAALSKIFAVAATYPYQWRARLQDQHMFYSGVI 
DVITKTWRKEGVGGFYKGIAPNLIRVTPACCITFWYENVSHFL 
LDLREKRK 
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corresponding 
to first 
amino acid 
residue of 
amino acid 


Predicted end 
nucleotide 

corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=lsoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=poGsible nucleotide deletion, 
\=possibls nucleotide insertion) 


6222 


2 


2116 


MARELRALLLWGRRLRPLLRAPALAAVPGGKPILCPRRTTAQLG 
PRRNPAWSLQAGRLFSTQTAEDKEEPLHSIISSTESVQGSTSKH 
EFQAETKKLLDIVARSLYSEKEVFIRELISNASDALEKLRHKLV 
SDGQALPEMKIHLQTNAEKGTITIQDTGIGMTQEELVSNLGTIA 
RSGSKAFLDALQNQAEASSKI IGQFGVGFYSAFMVADRVEVYSR 
SAAPGSLGYQWLSDGSGVFE IAEASGVRTGTKI I IHLKSDCKEF 
SSEARVRDWTKYSNFVSFPLYLNGRRMNTLQAIWMMDPKDVRE 
WQHEEFYRYVAQAHDKPRYTLHYKTDAPLNiRSIFYVPDMKPSM 
FDVSRELGSSVALYSRKVLIQTKATDILPKNLRFIRGWDSEDI 
PLNLSRELLQESALZRKLRDVLQQRLIKFFIDQSKKDAEKYAKF 
FEDY3LFMREGIVTATEQEVKEDIAKLLRYESSALPSGQLTSLS 
EYASRMPAGTRNIYYLCAPNRHLAEHSPYYEAMKKKDTEVLFCF 
EQFDELTLLHLREFDKKKLISVETDIVVDHYKEEKFEDRSPAAE 
CLSEKETEE1MAWMRNVLGSRVTNVKVTLRLDTHPAMVTVLEMG 
AARHFLRMQQIAKTQEERAQLLQPTLE INPRHALI KKLNQLRAS 
EPGLAQLLVDQIYENAMIAAGLVDDPRAMVGRLNELLVKALERH 


6223 


3 


715 


1 11 AGMVDFODEEQVKSFLENMEVECKYHCYHKKDPrGCY 
RLVDYLEGIRKNFDEAAKVLKFIJCEENQHSDSCYKLGAYYVTGK 
GGLTQDLKAAARCFLMACEKPGKKS1AACHNVGLLAHDGQWED 
GQPDLGXARDYYTRACDGGYTSSCFNLSAMFLQ3APGFPKDMDL 
ACK Y 5 M I'ACDLGHI WACANASRMYKLGDGVDKVEAKAEVLKNRA 
QQVHKEQQKGVQPLTFG 


5224 


1 


133 


LRTISSKAVJGPLLLTLLAHCTGSHAy VLTQPP3 ^ARIPHEK 


S225 


3259 


938 


LL E J LPE i j ] lTt 
QKEWRLLSPAQRALYREVTLENYSHLVSLGILHSKPELIRRLEQ 
GEVPWHEERRRRPGPCAGIYAEHVLRPKKLGLAHQRQQQLQFSD 
QSFQSDTAEGQEKEKSTKPMAFSSPPLRHAVSSRRRNSWE1ES 
SQGQRENPTEIDKVLKGIENSRWGAFKCAERGQDFSRKMMVIIH 
KKAHSRQKLFTCRECHQGFRDESALLLHQNTHTGEKSYVCSVCG 
RGFSLKANLLRHQRTHSGEKPFLCKVCGRGYTSXSYLTVHERTH 
TGEKPYECQECGRRFNDKSSYNKHLKAHSGEKPFVCKECGRGYT 
NKSYFWHKRIHSGEKPYRCQECGRGFSNKSHLITHQRTHSGEK 
PFACRQCKQSFSVKGSLLRHORTHSGEKPFVCKDCERSFSQKST 
LVYHQRTHSGEKPFVCRECG3GFIQKSTLVKHQITHSEEKPFVC 
KDCGRGFIQKSTFTLHQRTHSEEKPYGCRECGRRFRDKSSYNKH 
LRAHLGEKRFFCRDCGRGFTLKPNLTIHQRTHSGEKPFMCKQCE 
XSFSLKANLLRHQWTHSGERPFNCKDCGRGFILKSTLLFHQKTH 
SGEKPFICSECGQGFIWKSNLVKHQIAHSGKQPFVCKECGRGFN 
WKGNLLTHQRTHSGEKPFVCNVCGQGFSWKRSLTRHHWRIHSKE 
KPFVCQECKRGYTSKSDLTVHERIHTGERPYECQECGRKFSNKS 
YYSKHLKRHLREKRFCTGSVGEASS 






266 


TKVSE_,L3GSQRLFFLPLWRRLCRCGLGPRVSPMAGPRVEVDGS 
IKEGGGQSLRVSTGLSWLLSLPWRAQRIRAGRSYA 


6227 


2581 


890 


MSAS S LLjEQRPKGQGNKVQNGSVHQKDGLNDDDFEPYLS PQARP 
NNAYTAMSDSYLPSYYSPSIGFSYSLGEAAWSTGGDTAMPYLTS 
YGQLSNGEPHFLPDAMFGQPGALGSTPFLGQHGFNFFPSGIDFS 
AWGNNSSQGQSTQSSGYSSKYAYAPSSLGGAMIDGQSAFAMETL 
WKAPGMNTIDQGMAALKLGSTEVASMVPKWGSAVGSGSITSNI 
VASNSLPPATIAPPKPASWADIASKPAKQQPKLKTKNGIAGSSL 
PPPPIKHNMDIGTWDNKGPVAKAPSQALVQNIGQPTQGSEQPVG 
QQANNSPPVAQASVGQQTQPLPPPPPQPAQLSVQQQAAQPTRWV 
APRNRGSGFGHNGVDGNGVGQSQAGSGSTPSEPHPVLEKLRSIN 
NYNPKDFDWNLKHGRVFIIKSYSEDDIHRSIKYNIWCSTEHGNK 
RLDAAYRSMNGKGPVYLLFSWGSGHFCGVAEMKSAVDYNTCAG 
VW SQDKWKGRFDVRWI FVKDVPNSQLRH IRLENKENKPVTNSRD 
T2EVPLEKAKQVLKIIASYKHTTSIFDDFSHYEKRQ 
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SEQ 
ID 
NO: 


Predicted 
beginning 

nucleotide 

corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 

location 

to first 3 
amino acid 
residue of 
amino acid 


Amino acid segment containing signal peptide 
<A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








CLGQGAD I WAY I LQHVHSQRTVKKIRGNLLWYGHQDS PQVRRKL 
ELEAAVTRLRAEIQELDQSLELMERDTEAQDTAMEQARQHTQDT 
QRRALLLRAQAGAMRRQQHTLRDPMQRLQNQLRRLQDMERKAKV 
DVTFGSLTSAALGLEPWLRDVRTACTLRAQFLQNLLLPQAKRG 
SLPTPHDDHFGTSYQQWLSSVETLLTNHPPGHVLAALEHLAAER 
EAEIRSLCSGDGLGDTEISRPQAPDQSDSSQTLPSMVHLIQEGM 
RTVGVLVSQRSTLLKERQVLTQRLQGLVEEVEREVLGSSERQVL 

LHWRQLVEETQEQVRLLIKGNSASKTRLCRSPGEVLALVQRKW 
PTFEAVAPQSRELLRCLEEEVRHLPHILLGTLLRHRPGELKPLP 
TVLPSIHQLHPASPRGSSF1ALSHKLGLPPGKASELLLPAAASL 
RQDLLLLQDQRSLWCWDLLHMKTSLPPGLPTQELLQIQASCEKQ 
QKENLGQALKRLEKLLKQALERIPELQGIVGDWWEQPGQAALSE 
ELCQGLSLPQWRLRWVQAQGALQKLCS 


5229 


1571 


560 


GPSLLGTRGTPNPARTLQIFFLIIHRRT,TGRI4.AAVDDLQFEEFG 
NAATSLTANPDATTVNIEDPOETPKHQPGSPRGSGREEDDELLG 

LPIPGKNFVRLYIRSNPDLYGPFWICATLVFAIAISGHLSNFLI 
HLGE KTYHYVPEFRKVS IAATI I YAYAWLVPLALWGFLMWRNS K 
VMNIVSYSFLEIVCVYGYSLFIYIPTAILWIIPHKAVRWILVMI 
ALGISGSLLAMTFWPAVREDNRRVALATIVTIVLLKMLLSVGCL 
AYFFDAPEMDHLPTTTATPNQTVAAAKSS 








VGAPVYMAAVIEYLAAEILELAGNAARDNKICARIAPRHILLAVA 
NDEELNQLLKGVTIASGGVLPRIHPELLAKKRGTKGKSETILSP 
PPEKRGRKATSGKKGGKKSKAAKPRTSKKSKPKDSDKEGTSNST 
SEDGPGDGFTILSSKSLVLGQKLSLTQSDISHIGSMRVEGIVHP 
TTAEIDLKEDIGKALEKAGGKEFLETVKELRKSQGPLEVAEAAV 
SQS SGLAAKFVIHCH I PQNGSDKCEEQLEET I KNCLSAAEDKKL 
KSVAFPPFPSGRNCFPKQTAAQVTLKAISAHFDDSSASSLKNVY 
FLLFDSESIGIYVQEMAKLDAK 


6231 


149 


870 


, ! F=.SSTt/DRSLRNVLWSFGFLLLFTAYGGLQSLQSSLYSEEG 
LGVTALSTLYGGMLLSSMFLPPLLIERLGCKGTI ILSMCGYVAF 
SVGNFFASWYTL I PTE I LLGLGAAPLWSAQCTYLTITGNTHAEK 
AGI . "-CDW-NQYFGIFFLIFQSSGVWGNLISSLVFGQTPSQETL 
PEEQLTSCGASDCLMATTTTNSTQRPSQQLVYTLLGIYTGSGVl 
AVLMIAAFLQP IRDVQRESE 


6232 


3679 


1475 


FVAGTTMAGFWVGTAPLVAAGRRGRWPPQQLMLSAALRTLKHVL 
YYSRQCLMVSRNLGSVGYDPNEKTFDKILVANRGEIACRVIRTC 

IMEAIKKTRAQAVHPGYGFLSENKEFARCLAAEDWFIGPDTHA 
IQAMGDKIESKLLAKKAEVNTIPGFDGWKDAEEAVRIAREIGY 
PVMIKASAGGGGKGMRIAWDDEETRDGFRLSSQEAASSFGDDRL 
LIEKFIDNPRHIEIQVLGDKHGNALWLNERECSIQRRNQKWEE 
APS I FLDAETRRAMGEQAVALARAVKYSSAGTVEFLVDS KKNFY 
FLEMNTRLQVEHPVTECITGLDLVQEMIRVAKGYPLRHKQADIR 
INGWAVECRVYAEDPYKSFGLPSIGRLSQYQEPLHLPGVRVDSG 
IQPGSDISIYYDPMISKLITYGSDRTEALKRMADALDNYVIRGV 
THNIALLREVIINSRFVKGDISTKFLSDVYPDGFKGHMLTKSEK 
NQLLAIAS SLFVAFQLRAQHFQENSRMPVI KPD IANWELS VKLH 
DKVHTWASNNGSVFSVEVDGSKLN\ r TSTWHLASPLLSVSVDGT 
. LSREAGGNMSIQFLGTVYKVNILTRLAAELMKFMLEKV 
TJFTL 3VLXSPMPGVWAVSVKPGDAVAEGQEICVIEAMKMQNS 
MTAGKTGTVKSVHCQAGDTVGEGDLLVELE 


6233 




2654 


HSTRENLWAGMFNFPSEGHLVRSTGPGGSFAKHMVAQCVSPKGP 
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SEQ 
ID 


Predicted 

corresponding 
to first 
amino acid 

sequence 


Predicted and 
nucleotide 
location 
corresponding 

amino scid 

residue of 

sequence 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, K=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








LACSRTYFFGATHVPYLGGDSKIiPKKTEQIRLLSQIYAAVIEAV 
LAGIACYAKTSSLTKAKEVAEQTLGSGLDSFELIPFKAALRSKT1 
TFHIHAVNNQGR IVPLDSEDSLS FVKTACMAVYD I PDLLGGNGC 
LGSWFSESFLTSQILVKEKDGTVTTETSSVVLTAAVPRFCSWL 
VEDNEVKLSEKTHQAVR3DESFLGTYLTGGEGAYLYSSNLQSWP 
EEGNVHFFSSGLLFSHCRHGSI IISKDHMNSISFYDGD3TSTVA 
ALLIDFKS S LLPHLPVHFHGS SNFLMIALFPKSKI YQAFYS EVF 
SLWKQQDNSGISLKVIQEDGLSVEQKRLHSSAQKLFSALSQPAG 
EKRSSLKLLSAKLPELDWFLQHFAISSISQ3PVMRTHLPVLLQQ 
AEINTTHRIESDKVIISIVTGLPGCHASELCAFLVTLHKECGRW 
MVYRQIMDSSECFHAAHFQRYLSSALEAQQNRSARQSAYIRKKT 
RLLWLQGYTDVIDWQALQTHPDSNVKAS FTIGAI TACVE PMS 
CYMEHRFLFPKCLDQCSQGLVSNWFTSHTTEQRHPLLVQLQSL 
IRAANPAAAFILAENGIVTRNEDIELILSENSFSSPEMLRSRYL 
MYPGWYEGKLNAGSVYPLMVQICVWFGRPLEKTRFVAKCKAIQS 
SIKPSPFSGNIYHILGKVKFSDSERTMEVCYNTLANSLSIMPVL 
EGPTPPPDSKSVSQDSSGQQECYLVFIGCSIjKEDSIKDWLRQSA 
KQKPQRKALKTRGMLTQQEIRSIHVKRHLEPLPAGYFYNGTQFV 
NFFGDKTDFHPLMDQFMNDYVEEANREIEKYNQELEQQEYHDLF 


S234 


1731 




PRVREDMDHKSPGNKGSLVYAGIKSIVKSSLGMVESSRHNWSGL 
DKQSDIQNLNEERILALQLCGWIKKGTDVDVGPFLNSLVQEGEW 
ERAAAVALFNLDIRRAIQILNEGASSEKGDLNLNWAMALSGYT 
DEKNSLWREMCSTLRLQLNNPYLCVMFAFLTSETGSYDGVLYEN 
KVAVRDRVAFACKFLSDTQLNRYIEKLTNEMKEAGNLEGILLTG 
LTKDGVDLMESYVDRTGDVQTASYCMLQGSPLDVLKDERVQYWI 
ENYRNLLDAWRFWHKRAEFEIHRSKLDPSSKPLAQVFVSCNFCG 
KSISYSCSAVPHQGRGFSQYGVSGSPTKSKVTSCPGCRKPLPRC 
ALCLINMGTPVSSCPGGTKSDEKVDLSKDKKLAQFNNWFTWCHN 
CRHGGHAGHMLSWFRDHAECPVSACTCKCMQLDTTGNLVPAETV 

QP 


6235 




571 


EKRDHRLPSWPRAALKVPGRGGRVGTTPELAAG3IMATRNPPPQ 
DYESDDDSYEVLDLTEYARRHQWWNRVFGHSSGPMVEKYSVATQ 
IVMGGVTGWCAGFLFQKVGKLAATAVGC-GFLLLQIASHSGYVQ1 
DWKRVEKDV1TKAKRQ I KKRANKAAPE INNLIEEATE F IKQNIVI 
SSGFVGGFLLGLAS 


S236 




703 


WDQNKGAAAGSGLTLPSLPSARFSAGPPTQRSRPTMSNMEKHLF 
NLKFAAKELSRSAKKCDKEEKAEBCAKIKKAIQKGNKEVARI'HAE 
NAIRQKNQAVNFLRMSARVDAVAARVQTAVTMGKVTKSMAGWK 
SMDATLKTMWLEKISALMDKFEHQFETLDVQTQQMEDTMSSTTT 
LTTPQNQVDMLLQEMADEAGLDLNMSLPQGQTG3VGTSVASAEQ 
DELSQRLARLRDQV 


S237 




720 


PTAt/AEEGIAAGGVKDVKTALQEVLKTALIHDGLARGIREAAKA 
LDKRQAHLCVLASNCDEPMYVKLVEALCAEHQINLIKVDDNKKL 
GEWVGLCKIDREGKPRKWGCSCVWKDYGKESQAKDVIEEYFK 
CKK 


6238 


2 


4GSS 


EEVPTQESVKWEINVIIKNPEIVFVADMTKNDAPALVITTQCEI 
CYKGNLENSTMTAAIKDLQVRACPFLPVKRKGKITTVLQPCDLF 
YQTTQKGTDPQVIDMSVKSLTLKVSPVIINTMITITSALYTTKE 
TIPEETASSTAHLWEKKDTKTLKMWFLEESNETEKIAPTTELVP 
fCGEKIKMNIDSIFIVLEAGIGHRTVPMLLAKSRFSGEGKNWSSL 
INLHCQLELEVHYYNEMFGVWEPLLEPLEIDQTEDFRPWNLGIK 
MKKKAKMAIVESDPEEENYKVPEYKTVISFHSKDQLNITLSKCG 
LVMLNNLVKAFTEAATGS SADFVKDLAPFMI LNSLGLTI SVSPS 
DSFSVLNIPMAKSYVLKNGESLSMDYIRTKDNDHFNAMTSLSSK 
LFFILLTPVNHSTADKIPLTKVGRRLYTVRHRESGVERSIVCQI 
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SEQ 
ID 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 


Predicted end 

corresponding 
to first 
amino acid 
residue of 

sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=?heny^alanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=G1 ut amine , R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








DTVEGSKKVTIRSPVQIRNHFSVPLSVYEGDTLLGTASPENEFN 
I PhGS YRS F I FLKPEDENYQMCEGI DFEE 1 1 KNDGALLKKKCRS 
KHPSKESFLIN1VPBKDNLTSLSVYSEDGWDLPYIMHLWPPILL 
RNLLPYKIAYYIEGIENSVFTLSEGHSAQICTAQLGKARLHLKL 
LDYLNHDWKSEYHIKPNQQDISFVSFTCVTEMEKTDLDIAVHMT 
YNTGQTWAFHSPYWMVNKTGRMLQYKADGIHRKHPPNYKKPVI. 
FS FQPNHFFIIt\MKVQLMVTDSELSNQFS IDTVGSHGAVKCKGLK 
MDYQVGVT IDLS S FNI TRI VTFTP F YM I KNKS KYH I S VAEEGKD 
KWLSLDLEQCIPFWPEYASSKLLIQVERSEDPPKRIYFNKQENC 
ILLRLDNELGGIIAEVMIAEHSTVITFLDYHDGAATFLLItJHTK 
NELVQYNQSSLSEIEDSLPPGKAVFYTWADPVGSRRLKWRCRKS 
HGEVTQKDDMMMPIDLGEKTIYLVSFFEGLQRIILFTEDPRVFK 
VTYESEKAELAEQEIAVALQDVGISLVNNYTKQEVAYIGITSSD 
WWETKPKKKARWKPMSVKHTEKLEREFKEYTE3SPSEDKVIQL 
DTJWPVRLTPTGHNMKILQPHVIALRRNYLPALKVEYNTSAHQS 
SFRIQIYRIQIQNQIHGAVFPFVFYPVKPPKSVTMDSAPKPFTD 
VSIVMRSAGHSQISRIKYFKVLIQEMDLRLDLGFIYALTDLMTE 
AEVTENTEVELFHKDIEAFKEEYKTASLVDQSQVSLYEYFHISP 
IKLHLSVSLSSGREEAKDSKQNGGLIPVHSLNLLLKSIGATLTD 
VQDWFKLAFFELNYQFHTTSDLQSEVIRHYSKQAIKQMYVLIL 
GLDVLGNPFGLIREFSEGVEAFFYEPYQGAIQGPEEFVEGMALG 
LKAIVGGAVGGL.^GAASKITGAMAKGVAAMTMDEUYQQKRREAM 
NKQPAGFRE [ i L 1 3I\ TB PIKGAQRGGAAG 
FFKGVGKGLVGAVARPTGG 11DMASSTFQG3 KRATETSEVESLR 
PPRFFNEDGVIRPYRLRDGTGNQMLQKIQFYRENIMTHSSSSDD 
DDDDDDDDESDLNH 


6239 


2108 


63 4 


KPGMAGKGSSGRRPLLLGLLVAVATWLVICPYTKVEESFNLQA 
THDLLYHWQDLEQYDHLEFP5WPRTFLGPVVIAVFSSPAVYVL 
SLLEMSKFYSQLIVRGVLGLGVIFGLWTLQKEVRRHFGAMVATM 
FCWVTAMQFHLMFYCTRTLPNVLALPWLLALAAWLRHEWARFI 
WLSAFAIIVFRVELCLFLGLLLLLALGNRKVSWRALRHAVPAG 
ILCLGLTVAVDSYFWRQLTWPEC-KVLWYNTVLNKSSNWGTSPLL 
WYFYSALPRGLGCSLLFIPLGLVDRRTHAPTVIALGFMALYSLL 
PHKELRFI I YAFPMLNI TAARGCSYLLNNYKKSWLYKAGSLLVI 
GHLWNAAYSATALYVSHFNYPGGVAMQRLHQLVPPQTDVLLHI 
DVAAAQTGVSRFLQVNSAWRYDKREDVQPGTGMLAYTHILMEAA 
PGLLALYRDTHRVLASWGTTGVSLNLTQLPPFNVHLQTKLVLL 
ERLPRPS 


6240 


2202 


1176 


HERGDSLKEPTSIAESSRHPSYKSEPSLEPESFRSPTFGKSFHF 
DPLSSGSRSSSLKSAQGTGFELGQLQSIRSEGTTSTSYKSLANQ 
TRNGSLSYDSLLTPSDSPDFESVQAGPEPDPPLGYTSPFLSARL 
AQQREAERHPRLVPTGPTHREPSPVRYDNLSRHIVASLQEREKL 
LRQSPPLPGREEEPGLGDSGIQSTPGSGHAPRTSSSSDDSKRSP 
LGKTPLGRPAVPRFGKPDGLRGRGVGSPEPGPTAPYLGRSMSYS 
SQKAQPGVSETEEVALQPLLTPKDEVQLKTTYSKSNGQPKSLGS 
ASPGPGQPPLSSPTRGGVKKVSGVGGTTYEISV 


6241 


3 


1341 


RNAEEKKRLSLQREKIIARVSIDNRTRALVQALRRTTDPKLCIT 
RVEEI,TFHLLEFPEGKGVAVKERIIPYLLRLRQIKDETLQAAVR 
EILALIGYVDPVKGRG1R1LSIDGGGTRGWALQTLRKLVELTQ 
KPVHQLFDYICGVSTGAILAFMLGLFHMPLDECEELYRKLGSDV 
FSQNVIVGTVKMSWSHAFYDSQTWENILKDRMGSALiMIETARNP 
TCPKVAAVSTIVNRGITPKAFVFRNYGHFPGINSHYLGGCQYKM 
WQAIRASSAAPGYFAEYALGKDLHQDGGLLLNNPSAIiAMHECKC 
LWPDVPLECIVSLGTGRYESDVRNTVTYTSliKTKLSNVINSATD 
TEEVHIMLDGLLPPDTYFRFNPVMCENIPbDESRNEKLDQLQLE 
GLKY I ERNEQKMKKVAKILSQEKTTLQKINDWI KLKTDMYEGLP 
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SEQ 
ID 
NO: 


beginning 
nucleotide 

corresponding 
sequence 


Predicted end 
nucleotide 
location 
corresponding 

residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, P=Phenylalanine, G=Glycine, 
H=Histidinc, I=Isoleucine, K-Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glut amine , R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








FFSICj 


6242 


198 


1310 


-. STWSPGAAVCTARRFPGRSLAAFPRPAAPRRAVEMGE 
SSSDIDQMFSTLLGEMDLLTQSLGVDTLPPPDPNPPRAEFNYSV 
GFKDLNESLMALEDQDLDALMAnT.VADXSEAEQRTlQAQKESLQ 
NQHHSASLQASIFSC-AASLGYGTNVAATGISQY3DELPPPPADP 
VLDLPLPPPPPEPLSQEEEEAQAKADKIKLALEKLKEAKVKKLV 
VKVHMNDNSTKSLMVDERQLARDVLDNLFEKTHCDCNVDWCLYE 
IYPELQIERFFEDHENWEVLSDWTRDTENKILFLEKEEKYAVF 
KNPQNFYLDMRGKKESKETNEKMNAKNKESLLEVRLILQSGRKE 
KDVCSIFKSFASENNGKI 


6243 


1509 


614 


RSASRFSGCWSRDSTCCCCPSTCWSRSSASCPRARWPPSSAPAT 
TSRASSRRLACGPQTRAGAETRSTAMIRANSAARDTRRATCRSA 
AGTPSPTTMTCLTDVPTGCAAVEPTARLPAAAWASTITTGCCPA 
MGQAGAGPAGR KG SEAGGG PGRAHHAHPS PL P RE PRVRTGP PAK 
SPTPGSIDPSPELSWGSAGVTQESPLLDPVDFLLFRTRAVDPLR 
RVFFFFYQHLTFFSIQPQPPPCHAFHPRDPPAGTKRQLILVPLK 
GPPILAPILSLTPILSRWSCYFPRSRIAQGWHLS 


6244 


2119 


1745 


EH r'ASJFGTFLGNNESERCKLKLQQKTMSLWSWVNQPSlilLI:;?: 
FTNPLFEANNLVIWPSVAPQ3LPLWEGIFLRWNRSSKYLDEAYE 
EMVNIIEYNf E LQAI VTTILF QLAELETEDGMQESP 


6245 


81 


1148 


T.SI.RNAKYSFPQELISLFSMTDLNDNICKRYIKMITNIVILSLI 
ICISLAFWIISMTASTYYGNLRPISPWRWLFSVVVPVLIVSNGL 
KKKSLDHSGALGGLWGFILTIANFSFFTSLLMFFLSSSKLTKW 
KGEVKKRLDEEYKEGGQRNWVQVFCIJGAVPTELALLYMIENGPG 
E IPVDFS KQYSASWMCLSLLAALACSAGDTOASEVGPVLS KS S P 
RLI TTWE KVPVGTNGRVT WGLVSSLLGGTFVG IAYFLTQL I FV 
NDLDISAPQKPIIAFGGLAGLLGSIVDSYLGATMQYTGLDESTG 
MVWSPTNKARHIAGKPILDNNAVNLFSSVLIALLLPTAAWGFW 
PRG 


624S 




359 


SLWPWILMDDSLMQISLQLLCVYTANFPNGCSSLCWSSCGQHPV 
QATHRGAVSNSLMLCILKLASQMPLENTTVQQMVFMLLSNLALS 
HDCKGVIQKSWFLQNFLSLALPKGGNKHLSNLTILWLKLLLNIS 
SGEDGQQMILRLDGCLDLLTEMSKYKHKSSPLLPLLIFHNVCFS 
PANKPKI LANEKVI TVLAACLESENQHAQRIGAAALWALI YNYQ 
KAKTALKSPSVKRRVDEAYSIAKKTFPNSEANPLNAYYLKCLEN 
LVQLLNSS 


6247 




1678 


NSRVWGPWTEPSAGSLRPMARKQNRNSKELGLVPLTDDTSHAGP 
PGPGRALLECDHLRSGVFGGRRRKDWSCSLLVASLAGAFGSSFL 
YGYNLSWNAPTPYIKAFYNESWERRHGRPIDPDTLTI.TiWSVTV 
SIFAIGGLVGTLIVKMIGKVLGRKHTLLANHGFAISAALLMACS 
LQAGAFEMLIVGRFIMGIDGGVALSVLPMYLSEISPKEIRGSLG 
QVTAI FI CIGVFTGQLLGLPELL3KESTWPYLFGVI WPAWQL 
LSLPFLPDSPRYLLL3KHNEARAVKAFQTFLGKAHVSQEVEEVL 
A33RVQRSIRLVSVL3LLRAPYVRWQVVTVIVTMACYQLCGLNA 
I WFYTNS I FGKAG I P PARI PYVTLSTGG XETLAAVFSGLVI EHL 
GRRPLLIGGFGIiMGLFFGTLTI TLTLQDHAPWVP YLS I VGI LAI 
I AS FCSGPGGI P F I LTGEFFQQSQRPAAF I IAGTVTJWLSNFAVG 
LLFPFIQKSLDTYCFLVFATICITGAIYLYFVLPETKNRTYAEI 
SQAFSKRNKAYPPEEKIDSAVTDGKINGRP 


6248 


56 


1773 


' E ^HriAAVPPGLEPWNRVRIPKAGNRSAVTVQNPGAALDLCI 
AAVIKECHLVILSLKSQTLDAETDVLCAVLYSNHNRMGRHKPHL 
ALKQVEQCLKRLKNMNLEGSIQDLFELFSSNENQPLTTKVCWP 
SQPWELVLMKVLGACKLLLRLLDCCCKTFLLTVKHLGLQEFII 
LNLVMVGLVSRLWVLYKGVLKRLILLYEPLFGLLQEVARIQPMP 
YFKDFTFPSDITEFLGQPYFEAFKKKMPIAFAAKGINKLLMKLF 
LINEQSPRASEETLLGISKKAKQMKINVQNNVDLGQPVKNKRVF 
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Amino acid segment containing signal peptide 
<A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 
KEESSEFDVRAFCNQLKHKATQETSFDFKCSQSHLKTTKYSSQK 
VIGTPHAKSFVQRFREAESFTQLSEEIQMAVWCRSKKLKAQAI 
FLGNKLLKSNRLKHLEAQGTSLPKKLEC I KTS I CNHLLRGSGI K 
7SKHHLRQRRSQNKFLRRQRKPQRKLQSTLLREIQQFSQGTRKS 
ATDTSAKWRLSHCTVHRTDLYPNSKQLLNSGVSMPVIQTKEKMI 
HENLRGIHENETDSWWMQINKNSTSGTIKETDDIDDIF AIMGV 
VPPPRMMAAVPPGLEPWNRVRIPKAGNRSAVTVCNPGAALBLCI 
AAVIKECHLVILSLKSQTLDAETDVLCAVLYSNHNRMGRHKPHL 
ALKQVEQCLKRLKNMNLEGSIQDLFELFSSNENQPLTTKVCVVP 
SQPWELVLMKVLGACKLLLRLLDCCCKTFLLTVKHLGLQEFII 
LNLVMVGLVSRLWVLYKGVLKRLILLYEPLFGLLQEVARIQPMP 
YFKDFTFPSDITEFLGQPYFEAFKKKKPIAFAAKGINKLLNKLF 
LINEQSPRASEETLLGISKKAKQMKINVQNWDLGQPVKNKRVF 
KBESSEFDVRAFCNQLKHKATQETSFDFKCSQ3RLKTTKYSSQK 
VIGTPHAKSFVQRFREAESFTQLSEEIQMAVVWCRSKKLXAQAI 
FLGNKLLKSNRLKHLEAQGTSLPKKLECIKTSICNHLLRGSGIK 
TS KHHLRQRRSQNKFLRRQRKPQRKLQSTLLRE IQQFSQGTRKS 
ATDTSAKWRLSHCTVHRTDLYPNSKQLLNSGVSMPVIQTKEKMI 
HENLRSIHENETDSWTVMQINKNSTSGTIKETDDIJJDIFALMGV 
LAALHIKALPFRKDLEKYKDLDEDELLGNLSETELKQLETVLDD 
DDPENALLPAGFRQKNQTSKSTTGPFDREHLLSYr.EKEALEHKD 
REDYVPYTGEKKGKIFIPKQKPVQTFTEEKVSLDPELEEALTSi 
SDTEIiCDIiAAILGMHNLITNTKFCNIMGSSNGVDQEHFSNWKG 
EKILPVFDEPPNPTNVEESLKRTKENDAHLVEVNLNNIKNIPIP 
TLKDFAKALETNTHVKCFSLAATRSNDPVATAFAEMLKVNKTLK 
S LWVE SNF I TGVG I LAL I DALRDNETLAELK I DNQRQQLGTAVE 
LEMAKMLEENTNILKFGYQFTQQGPRTRAANAITKNNDLVRKRR 
VEGDHQ 

TPGSGPMSAWAAASLSRAAARCLLARGPGVRAAPPRDPRPSHPE 
PRGCGAAPGRTLHFTAAVPAGHNKWSKVRHIKGPKDVERSRII 
KLCLNIRLAVKEGGPWPEHNSNLANILEVCRSKHMPKSTIETAL 
KMEKSKDTYLLYEGRGPGGSSLLIEALSKSSHKCQADIRHIIjNK 
NGGVI-1AVGARHSFDKKGVIWEVEDREKKAVMLERALEMAIEAG 
AEDVKETEDEEERIWFKFICDASSLHQVRKKLDSLGLCSVSCAL 
EFIPNSKVQLAEPDLEQAAHLIQALSHHEDVIHVYDNIE 
EEFCTOIAVRVGEMETAPKPGKDVPPKKDKLQTKRKKPRRYWEE 
ETVPTTAGASPGPPRNKKNRELRPQRPKNAYILKKSRISKKPQV 
PKKPREWKNPESQRGLSGAQDPFPGPAPVPVEWQKFCRIDKSR 
KLPHSKAKTRSRLEVAEAEEEETSIKAARSELIiLAEEPGFLEGE 
DGEDTAKICQADIVEAVDIASAAKHFDLNLRQFGPYRLNYSRTG 
RHLAFGGRRGHVAALDWVTKKLMCEUST7MEAVRDIRFLHSEALL 
AVAQNRWLHIYDNQGIELHCIRRCDRVTRLEFLPFHFLLATASE 
TGFLTYLDVSVGKIVAALNARAGRLDVHSQNPYNAVIHLGHSNG 
TVSLWSPAMKEPLAKILCHRGGVRAVAVDSTGTYMATSGLDHQL 
KI FDLRGT YQ PLS TRTL PHGAGHLAFS QRGLLVAGMGDVWI WA 
GQGKASPPSLEQPYLTHRLSGPVHGLQFCPFEDVLGVGHTGGIT 
SMLVPGAGEPNFDGLESNPYRSRKQRQEW3VKALLEKVPAELIC 
LDPRALAEVDVISLEQGKKEQIERLGYDPQAKAPFQPKPKQKGR 
SSTASLVKRKRKVMDEEHRDKVRQSLQQQHHKEAKAKPTGARPS 

ALDRFVR 

EEFCTWIAVRVGEMETAPKPGKDVPPKKDKLQTKRKKPRRYWEE 
ETVPTTAGASPGPPRNKKNRELRPQRPKNAYILKKSRISKKPQV 
PKKPREWKNPESQRGLSGAQDPFPGPAPVPVEWQKFCRIDKSR 
KLPHSKAKTRSRLEVAEAEEEETSIKAARSELLLAEEPGFLEGE 
DGEDTAKICQADIVEAVDIASAAKHFDLNLRQFGPYRLNYSRTG 
RHIiAFGGRRGHVAALDWVTKKLMCEINVMEAVRDIRFLHSEALL 
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NO: 
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nucleotide 

corresponding 
to first 
amino acid 

amino acid 
sequence 


Predicted enc 
nucleotide 
location 
corresponding 

amino acid 

amino acid 


Amxno acid segment containing signal peptide 
<A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 

p=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








AVAQNRWLHIYDNQGIELHCIRRCDRVTRLEFLPFHFLLATASE 
TGFLTYLDVSVGKIVAALNARAGRLDVMSQHPYNAVIHLGHSNG 
TVS'LWSPAMKEPLAKILCHRGGVRAVAVDSTGTYMATSGLDHQL 
KIFDLRGTYQPLSTRTLPHGAGHLAFSQRGLLVAGMGDWNIWA 
GQGKASPPSLEQPYLTHRLSGPVHGLQFCPFEDVLGVGHTGGIT 
SMLVPGAGEPNFDGLESNPYRSRKQRQEWEVKALLEKVPAELIC 
LDPRALAEVDVISLEQGKKEQIERL3YDPQAKAPFQPKPKQKGR 
SSTASLVKRKRKVMDEEHRDKVRQSLCQQHHK2AKAKPTGARPS 


6254 


155 


113 9 


HALGRRGGSQELSAAACGCFALRLRAPGSGRPALAFGAAAFAGL 
GGAPRFPPRGSAAGRTMLLKEYRICMPLTVDEYKIGQLYMISKH 
SHEQSDRGEGVEWQNEPFEDPHHGNGQFTEKRVYLNSKLPSWA 
RAWPKIFYVTEKAWNYYPYTITEYTCSFLPKFSIHIETKYEDN 

SEKTGRGQLREGWRDSHQPIMCSYKLVTVKFEVWGLQTRVEQFV 
HKWRDILLIGHRQAFAWVDF.WYDMTMDDVREYEKNMHEQTNIK 
Ii tSPVDDIESHAQTST 


S255 


1 




PTRPQQELLVSLATVIFVASQKALSVESKAVIKQQLESVSNGWT 
VYRIARQASRMGNHDMAKELYQELLTQVASKIIFYFHLNSLKEFS 
HAEQCLTGLQEENYSSALSCIAESLXFYHKGIASLTAASTPLNP 
LSFQCEFVKLRIDLLQAFSQLICTCNSLKTSPPPAIATTIAMTL 
GKDLQRCGRISNQMKQSM3EFRSLASRYGDLYQASFDADSATLR 
NVELQQQSCLLISHAIEALILDPESASFCEYGSTGTAHADS2YE 
RRMMSVYNHVLEEVESLNGKYTPVSYMHTACLCNAIIALLKVPL 
SFQRYFFQKLQSTSIKLALSPSPRNPAEPIAVQNNQQLALKVEG 
WQHGSKPGLFRKI QS VCLNVSSTLQSKSGQDYKIPIDNMTNEM 
EQRVEPHNDYFSTQFLLNFAI LGTHNI TVES S VKDANG I VWKTG 
PRTTIFVKSLEDPYSQQIRLQQQQAQQPLQQQQQRNAYTRF 


6256 


1 


1542 


crgagaepaanprsprslvpsleststsvppapgtmatdswalia 

edraaqsllnklirsnlvdntnqvevlqrdpnsplysvksfeel 
rlkpqllqgvyamgfnrpskiqenalplmlaeppqnliaqsqsg 
tgktaafvlamlsqvepankypqclclsptyelalqtgkvieqm 

fidpkkikvfvldeadvmiatqghqdqsiriqrmlprncqmllf 
satfedsvwkfaqjcwpdpnvikiikreeetldtikqyyvlcssr 
dekfqalcnlygaitiaqamifchtrktaswlaaelskeghqva 
llsgemmveqraavierfregkekvlvttnvcargidveqvsw 
infdlpvdkdgnpdnetylhrigrtgrfgkrglavnmvdskhsm 
nilkriqehfnkkierldtddldeiekim 


6257 


21D 


615 


AFIPAMAELIQKKLQGEVEKYQQLQKDLSKSMSGRQKLEAQLTE 
NNIVKEELALLDGSNWFKLLGPVLVKQELGEARATVGKRLDYI 
TAEIKRYESQLRDLERQSEQQRETLAQLQQEFQRAQAAKAGAPG 
KA 


6258 


210 


615 


AFIPAMAELIQKKLQGEVEKYQQLQKDLSKSMSGRQKLSAQLTE 
WNIVKEELALLDGSNWFKLLGPVLVKQELGEARATVGKRLDYI 
TAEIKRYESQLRDLERQSEQQRETLAQLQQEFQRAQAAKAGAPG 
KA 


6259 


2 


1540 


I ij'SKGFPSQCHPERKWKVDDVLESSQEWEDDHFWELLFHWNKTV 
S VENGDRGSKTFMLGTDPVSLRNYPYKI CDS CEMNLKNISGL 1 1 
S KKNCSRKKPDE FNVCEKLLLD IRHEKIP IGEKS YKYDQKRNAI 
HYHQDLSQPSFGQSFEYSKNGQGFHDEAAFFTNKRSQIGETVCK 
YNECGRTF IESLKLNI SQRPHLEMEP YGCS I CGKS FCMKLRFGH 
QRALTKDNPYEYNEYGE1FCDNSAFIIHQGAYTRKILREYKVSD 
1 1 1 1 SALLKHQIVHMGGKSYDYNENGSNFSKKSHLTQLRRAHT 
GEKTFECGECGKTFWEKSNLTQHQRTHTGEKPYECTECGKAFCQ 
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Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=31ycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginxne, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








KPHLTNHQRTHTGEKPYECKQCGKTFOTKSNLTEHQRTHTGEKP 
YECNACGKSFCHRSALTVHQRTHTGEKPFICNECGKSFCVKSNL 
IVHQRTHTGEKPYKCNECGKTFCEKSALTKHQRTHTGEKPYECN 
ACGKTFSQRSVLTKHQRIHTRVKALSTS 


S260 


2081 




GTGPEIHACAHASARAPGSRAMALRELKVCLLGDTGVGKSSIVW 
RFVEDSFDPNINPTIGASFMTKTVQYQKELHKFLIWDTAGQERF 
RALAPMYYRGSAAAIIVYDITKEETFSTLKNOTKELRQHGPPNI 
WAIAGNKCDLIDVREVT-5ERDAKDYADSIHAIFVETSAKWAINI 
NELFIEISRRIPSTDANLPSGGKGFXLRRQPSEPKRSCC 


6261 






FWYRLGPGTRSRWPRRGSWAASLVPRGPSPAALVTSPCPPDPLR 
SPACEPCRPDFAPRPALLLRSGPRSAPAVTGKPALKGQPGPWPG 
MAEVSIDQSKLPGVKEVCRDFAVLEDHTLAHSLQEQEIEHHLAS 
NVQRNRLVQHDLQVAKQLQEEDLKAQAQLQKRYKDLEQQDCEIA 
QEIQEKLAIEAERRRIQEKKDEDIARLLQEKELQEEKKRKKHFP 
EFPATRAYADSYYYEDGGMKPRVMKEAVSTPSRMAHRDQEWYDA 
EIARKLQEEELLATQVDMRAAQVAQDEEIARLLMAEEKKAYKKA 
I E E otLDKRKQDPEWKPKTAKAANSKSKESDEPHIISKHERPA 
RPPPPIMTDGEDADYTHFTNQQSSTRHFSKSESSHKGFHYKH 


5262 


2 


1759 


r EC :H 3 Q CS VHRPGKVPQARMSGLVLGQRDE PAGHRLSQEE Ilj 
GSTRLVSQGLEALRSEHQAVLQSLSQTIECLQQGGHEEGLVHEK 
ARQLRRSMEWIELGLSKAQVMLALASHLSTVESEKQKLRAQVRR 
LCQENQWLRDELAGTQQRLQRSEQAVAQLEEEKICHLEFLGQLRQ 
YDEDGHTSEEKEGDATKDSLDDLFPNEEEEDPSNGLSRGQGATA 
AQQGGYEIPARLRTLHNLVIQYAAQGRYEVAVPLCKQALEDLER 
TSGRGHPDVATMLNILALVYRDQNKYKEAAHLLNDALSIRESTL 
GPDHPAVAATLNNLAVLYGKRGKYKEAEPLCQRALEIREKVLGT 
NHPDVAKQLNNLALLCQNQGKYEAVERYYQRALAIYEC-QLGPDN 
PNVARTKNKLAS C YL KQGKYAEAE TLYKE I LTRAH VQE FGS VDD 
DHKPIWMHAEEREEMSKSRHHEGGTPYAEYGGWYKACKVSSPTV 
NTTLRNLGALYRRQGKLEAAETLEECALRSRRQGTDPISQTKVA 
ELLGESDGRRTSQEGPGDSVKFEGGEDASVAVEWSGDGSGTLQR 
SGSLGKIRDVLRR 


G2C3 




2408 


RELDSLADLPERIKPPYANGLSTSHLRSSSVEDVKLIISEGRPT 
IEVRRCSMPSVICEHTKQFQTISEESNQGSLLTVPGDTSPSPKP 
EVFSNVPERDLSNVSNIHSSFATSPTGASNSKY.VSADRNLIKNT 
APVJNTVMUSPVHLEPSSQVGVIQNKSWEMPVDRLETLSTRDFIC 
PNSNIPDQESSLQSFCNSENKVLKENADFLSLRQTELPGNSCAQ 
DPASFMPPQQPCSFPSQSLSDAESISKHMSLSYVANQEPGILQQ 
KNAVQIISSALDTDNESTKDTENTFVLGDVQKTDAFVPVYSDST 
IQEASPNFEKAYTLPVLPSEKDFNGSDASTQLNTHYAFSKLTYK 
SSSGHEVENSTTDTQVISHEKENKLESLVLTHLSRCDSDLCEMN 

QSTQPEMHKYGQLVKVELEENAEDDKTENQIPQRMTRNKANTMA 
NQSKQILASCTLLSEKDSESSSPRGRIRLTEDDDPQIHHPRKRK 
VSRVPQPVQVSPSLLQAKEKTQQSLAAIVDSLKLDEIQPYSSER 
ANPYFEYLHIRKKIEEKRKLLCSVIPQAPQYYDEYVTFNGSYLL 
DGNPLSKICIPTITPPPSLSDPLKELFRQQEWRMKLRLQHSIE 
REKLIVSNEQEVLRVHYRAARTLANQTLPFSACTVLLDAEVYNV 
PLDSQSDDSKTSVRDRFNARQFMSWLQDVDDKFDKLKTCLLMRQ 
QHEAAALNAVQRLEWQLKLQELDPATYKSISIYEIQEFYVPLVD 
VNDDFELTPI 


62S4 


143 




KHRQENNALDMAPEIHMTGPMCLIENTHGELVANPEALKILSAZ 
TQPWWAIVGLYRTGKSYLMNKLAGKHKGFSLGSTVKSHTKGI 
WMWCVPHPKKPEHTLVLLDTEGLGDVKKGDNQNDSWIFTLAVLL 
S S TLVYNS MGTI NQQAMDQLY YVTELTHR I RS KS S PDENENEDS 
ADFVSFFPDFVWTLRDFSLDLEADGQPLTPDEYLEYSLKLTQGT 
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location 
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amino acid 


Predicted end 
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amino acid 

sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glynine, 
H=Histidine, I=Isoleucine, K=Lysine, 

P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








SQKDKN FNL PRLC I RKF FP KKKC F VFDL P I HRRKLAQLEKLQDE 
ELDPEFVQQVADFCSYIFSNSKTKTLSGGIKVNGPRLESLVLTY 
INAISRGDLPCMENAVLALAQIENSAAVQKAIAKYDQQMGQKVQ 
LPAETLQELLDLHRVSEREATEVYMKNSFKDVDELFQKKLAAQL 
DKKRDDFCKQNQEASSDRCSALLQVIFSPLEEEVKAGIYSKPGG 
YCLFIQKLQDLEKKYYEEPRKGIOAEEILQTYLKSKESVTDAIL 
QTDQILTEKEKEIEVECVKAESAQASAKMVEEMQIKYQQMMEEK 
EKSYQEHVXQLTEXMERERAQLLEEQEX7LTSKLQEQARVLKER 
CQGESTQLQNEIQKLQKTLKKKTKRYMSHKLKI 


£265 


143 




KHRQENNALDMAPEIHMTGPMCLIENTNGELVANPEALKILSAI 
TQPAA7WAIVGLYRTGKSYLMNKLAGKNKGFSLGSTVKSHTKGI 
WMWCVPHPKKPEHTLVLLDTEGLGDVKKGDNQNDSWIFTLAVLL 
SSTLVYNSMGTINQQAMDQLYYVTELTHRIRSKSSPDEKTEKEDS 
ADFVSFFPDFVWTLRDFSLDLEADGQPLTPDEYLEYSLKLTQGT 
SQKDKNFNLPRLCIRKFFPKKKCFVFDLPIHRRKLAQLEKLQDE 
ELDPEFVQQVADFCSYIFSNSKTKTLSGGIKVNGPRLESLVLTY 
INAISRGDLPCMENAVLALAQIENSAAVQKAIAHYDQQMGCKVQ 
LPAETLQE LLDLHRVS E REATE V YMKKS FKDVDHLFQKKLAAQL 
DKKRDDFCKQNQEASSDRCSALLQVIFSPLEEBVKAGIYSKPGG 
YCLFIQKLQDLEKKYYEEPRKGIQAEEILQTYLKSRESVTDAIL 
QTBQILTEKEKEIEVECVKAESAQASBKMVEKMQIKYQQMMEEK 
EKSYQEHVKQLTEKMERERAQLLEEQEKTLTSECLQEQARVLKER 
CQGESTQLQNE1QKLQKTLKKKTKRYMSHKLKI 


6266 


276 


1421 


> H( :CMLVPCFLYSLQNRKPSLYGSLTCQGIGLDGIPEVTASE 
GFTVNEINKKSIHISCPKENASSKFLAPYTTFSRIHTKSITCLD 
ISSRGGLGVSSSTDGTMKIWQASNGELRRVLEGHVFDVNCCRFF 
PSGLWLSGGMDAQLKIWSAEDASCWTFKGHKGGILDTAIVDR 
GRNWSASRDGTARLWDCGRSACLGVLADCGSSINGVAVGAADN 
SINLGSPEQMPSEREVGTEAKMLLLAREDKKLQCLGLQSRQLVF 
LF IGSDAFNCCTFLSGFLLLAGTQDGN I YQLDVRS PRAPVQVIH 
RSGAPVLSLLSVRDGFIASQGDGSCFIVQQDLDYVTELTGADCD 
PVYKVATWEKQIYTCCRDGLVRRYQLSDL 








LGMMKKNNSAKRCPQDGNQQPAPPEKVGWTOKFCGKGIFREIWK 
NRYWLKGDQLYISEKEVKDEKNIQEVFDLSDYEKCEELRKSKS 
RSKKNHSKFTIiAHSKQPGNTAPNLIFLAVSPEEKESWINAIjNSA 
ITRAKNRILDEVTVEEDSYLAHPTRDRAKIQHSRRPPTRGHLMA 
VASTS TSDGMLTLDL IQEEDPS PEEPTSLC 


6268 


«. 


13S8 


HRELCQNLPAGLSSALIDNPLTLLLSIDTYVMLQEPVTFQDVAV 
DFSREEWGLLGPTQRTEYRDVMLETFGHLVSVGWETTLENKELA 
PNSDIPEEEPAPSLKVQESSRDCALSSTLEDTLQ3GVQEVQDTV 
LKQMESAQEKDLPQKKHFDNRESQANSGALDTNQVSLQKIDNPE 

KSCEROKAKEGNGCRKTFSRSTKQITFIRIHKGSQVCRCSECGK 
IFRNPRYFSVHKKIHTGERPYVCQDCGKGFVQSSSLTQHQRVHS 
GERPFECQECGRTFNDRSAISQHLRTHTGAKPYKCQDCGKAFRQ 
SSHLIRHQRTHTGERPYACNKCGKAFTQSSHLIGHQRTHNRTKR 
KKKQPTS 


6269 


2886 


1449 


HASAPTRRNMAAASPLRDCHAWKDARLPLSTTSNEACKLFDATL 
TQYVKWTNDKSLGGIEGCLSKLKAADPTFVMGHAMATGLVLIGT 
GSSVKLDKELDLAVKTMVEISRTQPLTRREQLHVSAVETFANGN 
FPICACELWEQILQDHPTDMLALKFSHDAYFYLGYQEQMRDSVAR 
I YPFWTPD I PLSS YVKG I YS FGLMETNFYDQAEKIAKEALS INP 
TDAWSVHTVAHIHEMKAEIKDGLEFMQHSETLWKDSDMLACHNY 
WHWALYLIEKGEYEAALTIYDTHILPSLQANDAMLDWDSCSML 
YRLQMEGVSVGQRWQDVLPVARKHSRDHILLFNDAHFLMASLGA 
HDPQTTQELLTTLRDASESPGENCQHLLARDVGLPLCQALVEAE 
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Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginins, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








DGNPDRVLELLLPIRYRIVQLGGSNAQRDVFNQLLIHAALNCTS 
SVHKNVARSLLMERDALKPNSPLTERLIRKAATVHLMQ 


6270 


23 


2086 


SVTVTLGSEGDGRPPTYHLEEMEQEPQNGEPAE1KIIREAYKKA - 

GPGWESARQMQQKMKETLQNVRTRLEILEKGLATSLQNDLQEVP 
KLYPEFPPKDMCEKLPEPQSFSSAPQHAEVNGNTS7PSAGAVAA 
PAS LSLP SQS CPAEAP PAYTPQAAEGHYTVS YGTDSGE FSS VGE 
EFYRNHSQPPPLETLGLDADELILIPNGVQIFFWPAGEVSAPS 
YPGYLRIVRFLDNSLDTVLNRPPGFLQVCDWLYPLVPDRSPVLK 
CTAGAYMFPDTMLQAAGCFVGWLSSELPEDDRELFEDLLRQMS 
DLRLQANWNRAEEENEFQIPGRTRPSSDQLKEASGTDVKQLDQG 

EKPVEVSPAVTKGLYIAKQATGGAAKVSQFLVDGVCTVANCVGK 
ELAPHVKKHGSKLVPESLKKDKDGKSPLDGAMWAASSVQGFST 
VWQGLECAAKCIVNNVSAETVQTVRYKYGYNAGEATHHAVDSAV 
NVG VTAYNI KIN IG I KAMVKKTATQTGHTLLEDYQIVDN9 QRENQ 
EGAANVNVRGEKDEQTKEVKEAKKKDK 


6271 


32 


1058 


GCGVKTAGMVGREKELSIHFVPGSCRLVEEEVNIPNRRVLVTGA 
TGLLGRAVHKEFQQNNWHAVGCGFRRARPKFEQVNLLDSNAVHH 
IIIIDFQPHVIVHCAAERRPDWENQPDAASQLNVDASGNLAKEA 
AA7GAFLIYISSDYVFDGTNPPYREEDIPAPLNLYGKTKLDGKK 
AVLENNLGAAVLRIPILYGEVEKLESSAVTVMFDKVQFSNKSAN 
MDHWQQRFPTHVKDVATVCRQLAEKRMLDPSIKGTFHWSGNEQM 
TKYEMACAIADAFNLPSSHLRPITDSPVLGAQRPRNAQLDCSKL 
ETLGIGQRTPFRIGIKESLWPFLIDKRWRQTVFH 


6272 


113 6 


528 


GAVMEDAAAPGRTEGVLERQGAPPAAGQGGALVELTPTPGGLAL 
VSPYHTHRAGDPLDLVALAEQVQKADEFIRANATNKLTVIAEOI 
QHLQEQARKVLEDAHRDANLHHVACNIVKKPGNIYYLYKRESGQ 
QYFSIISPKEWGTSCPHDFLGAYKLQHDLSWTPYEDIEKQDAK1 
SMMDTLLSQSVALPPCTEPNFQGLTH 


6273 


256 


843 


SCPRVSPECRSLGCQVMFSLPLNCSPDHIRRGS GWGRPQDLKIA 
SAAWNSKCHPGAGAAMARQHARTLWYDRPRYVFMEFCVEDSTDV 
HVLIEDHRIVFSCKNADGVELYNEIEFYAKVNSKDSQDKRSSRS 
ITCFVRKWKEKVAWPRLTKEDIKPVWLSVDFDWWRDWEGDEEME 
LAHVEHYAEVRDNTYCVLPT 


6274 


56 


1142 


AAAAMAAAAGGGAGAARSLSRFRGCJoAGALLGDCVGSFYEAHDT 
VDLTSVLRHVQSLEPDPGTPGSERTEALYYTDDTAMARALVQSL 
LAKEAFDEVDMAHRFAQEYKKDPDRGYGAGWTVFKKLLNPKCR 
DVFE PARAQFNGKGS YGNGGAMRVAG I SLAYS SVQDVQKFARLS 
AQLTHASSLGYNGAILQALAVHLALQGESSSKHFLKQLLGHMED 
LEGDAQSVLDARELGMEERPYSSRLKKIGELLDQASVTREEWS 
ELGNGIAAFESVPTAIYCFLRCMEPDPEIPSAFNSLQRTLIYSI 
SLGGDTDTIATMAGAIAGAYYGMDQVPESWQQSCEGYEETDIIA 
QSLHRVFQKS 


6275 


20 


565 


SRRGRARCLARGSRRPVPRPAKTMAFMVKTMVGGQLKNLTGSLG 
GGEDKGDGDKSAAEAQGMSREEYEEYQKQLVEEKMERDAQFTQR 
.-LRSHFRDKYRLPKNETDESQIQMAGGDVELPRELAKMI 
EEDTEEEEEKASVLGQLASLPGLNLGSLKDKAQATLGDLKQSAE 
KCHVN 


6276 


797 


97 


TLLPLPPLPDTEGMILLNTGLEGTVAENPVPIVHTPSGNILTLE 
SCLQQLATHPGHWGIHLQIAEPAALRPSLALLARLSSLGLLHWP 
VWVGAKISHGSFSVPGHVAGRELLTAVAEVFPHVTVAPGWPEEV 
LGSGYREQLLTDMLELCQGLWQPVSFQMQAMLLGHSTAGAIGRL 
LASSPRATVTVEHNPAGGDYASVRTALLAARAVDRTRVYYRLPQ 
GYHKDLLAHVGRK 
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) acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine ( G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=I.eucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine , R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosir_e, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
^possible nucleotide insertion) 



2744 MAFRTEMGLYYSYFKTIVEAPSFLNGVWMIMNDKLTEYPLVINT 
LKRFNLYPEVILASWYRIYTKIMDLIGIQTKICWTVTIGEGLSP 
TESCEGLGDPACFYVAVIFILNGLMMALFFIYGTYLSGSRLGGL 
VTVLCFFFNHGECTRVMWTPPLRESFSYPFLVLQMLLVTHILRA 
TKLYRGSLIALCISNVFFMLPWQFAQFVLLTQIASLFAVYWGY 
IDICKLRKIIYIHMISIALCFVLMFGNSMLLTSYYASSLVIIKG 
ILAMKPHFLKINVSELGLWVIQGCFWLFGTVILKYLTSKIFGIA 
NDAHIGNLLTSKFFSYKDFDTLLYTCAAEFDFMEKETPLRYTKT 
LLLPWLVGFVAIVRKIISDMWGVLAKQQTHVRKHQFDHGELVY 
HALQLIAYTALGILIMRLKLFLTPHMCVMASLICSRQLFGWLFC 
KYHPGA I VFA ILAAMS I QGSANLQTQWNI VGEFS ML PQEEL IE W 
IKYSTKPDAVFAGAMPTMASVKLSALRPIVNHPHYEDAGLRART 
KIVYSMYSRKAAEEVKRELIKLKVNYYILEESWCVRRSKPGC 
PE1WDVEDPANAGKTPLCNLLVKDSKPHFTTVFQNSVYKVLEW 
KE 



ILFRLVLLSLVYLLNSVATEERKPABVLIVEGQQYAWGTVLJjL 
IRIILEYCQGVDNIPSVTTDMLTRLSDLLKYFNSRSCQLVLGAG 
ALQWGLKTITTKNLALSSRCLQLIVHYIPVIRAHFEARLPPKQ 
YSMLRHFDHITKDYHDHIAEISAKLVAIMDSLFDKLLSKYEVKA 
PVPSACFRNICKQMTKMHEAIFDLLPEEQTQMLFLRINASYKLH 
LKKQLSHLNVINDGGPQNGWTADVAFYTGNLQALKGLKDLDLN 
MAEIWEQKR 

~ iSNSKLPGSIQHVYGAQHPPFDPLLHGTL 
LRSTAKMPTTPVKAKRVSTFQEFESNTSDAWDAGEDDDELLAMA 
AESLKSEVVMETANRVLRNHSQRQGRPTLOEGPGLQQKPRPEAE 
PPSPPSGDLRLVKSVSESHTSCPAESASDAAPLQRSQSLPHSAT 
VTLGGTSDPSTLSSSALSEREASRLDKFKQLLAGPNTDLEELRR 
LSWSGIPKPVRPMTWKLLSGYLPANVDRRPATLQRKQKEYFAFI 
EHYYDSRNDEVHQDT YRQIH I D I FRMS PEAL I LQPKVTEI FER I 
LFIWAIRHPA5GYVQGINDLVTPFFWFICEYIEAEEVDTVDVS 
GVPAEVLCWIEADTYWCMSKLLDGIQDNYTFAQPGIQMKVKMLE 
ELVSRIDEQVHRHLDQHEVRYLQFAFRWMWMLLMREVPLRCTIR 
LWDTYQSEPDGFSHFHLYVCAAFLVRWRKEILEEKDFQELLLFL 
QNLPTAHWDDEDISLLLAEAYRLKFAFADAPMHYKK 
ECCDQKMGSRNSSSAGSGSGDPSEGLPRRGAGLRRSEEEEEEDE 
DVDLAQVLAYLLRRGQVRLVQGGGAANLQFIQALLDSEEENDRA 
WDGRLGDRYNPPVDATPDTRELEFNEIKTQVELATGQLGLRRAA 
QKHSFPRMLHQRERGLCHRGSFSLGEQSRVISHFLPNDLGFTDS 
YSQKAFCGIYSKDGQIFMSACQDOTIRLYDCRYGRFRKFKSIKA 
RDVGWSVLDVAFTPDGNHFLYSSWSDYIHICNIYGEGDTHTALD 
LRPDERRFAVFSIAVSSDGREVLGGANDGCLYVFDREQNRRTLQ 
IESHEDDVNAVAFADISSQILFSGGDDAICKVWDRRTMREDDPK 
PVGALAGHQDGITFIDSKGDARYLISNSKDQTIKLWDIRRFSSR 
EGMEASRQAATQQNWDYRWQQVPKKAWRKLKLPGDSSLMTYRGH 
GVLHTLIRCRFSPIHSTGQQFIYSGCSTGKWVYDLLSGHIVKK 
LTNHKACVRDVSWHPFEEKIVSSSWDGNLRLWQYRQAEYFQDDM 
PESEECASAPAPVPQSSTPFSSPQ 
tL'CDQXMGSRNSSSAGSGSGDPSEGLPRRGAGLRRSEEEEEEDE 
DVDLAQVLAYLLRRGQVRLVQGGGAANLQFIQALLDSEEENDRA 
WDGRLGDRYNPPVDATPDTRELEFNEIKTQVELATGQLGLRRAA 
t.t 1 . RMLHQRERGLCHRGSFSLGEQSRVISHFLPNDLGFTDS 
\ 3C "A7CGIYSKDGQIFMSACQDQTIRLYDCRYGRFRKFKSIKA 
RDVCWSVLDVAFTPDGNHFLYSSWSDYIHICKIYGEGDTKTALD 
LRPDERRFAVFSIAVSSDGREVLGGANDGCLYVFDREQNRRTLQ 
IESHEDDVNAVAFADISSQILFSGGDDAICKVWDRRTMREDDPK 
PVGALAGHQDG I TFIDSKGDARYLISWS KDQT IKLWDIRRFS SR 
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nucleotide 
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sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalar.ine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 

S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








EGMEASRQAATQQNWDYRWQQVPKKAWRKLKLPGDSSLMTYRGH 
GVLHTLIRCRFSPIHSTGQQFI YSGCSTGKVWYDLLSGHIVKK 

PESEKCASAPAPVPQSSTPFSSPQ 


6282 


125 


90S 


RMAACRALKAVLVDLSGTLHIEDAAVPGAQEALKRLRGASVIIR 
FVTNTTKESKQDLLERLRKLEFDIS3DEIFTSLTAARSLLERKQ 
VRPMLLVDDRALPDFKGIQTSDPNAVVMGIAPEHFHYQILNQAF 
RLLLDGAPLIAIHKARYYKRKDGLALGPGPFVTALEYATDTKAT 
WGKPEKTFFLEALRGTGCEPEEAVMIGDDCRDDVGGAQDVGML 
GILVKTGKYRASDEEKINPPPYLTCESFPHAVDHILQHLL 


5283 


140 


1U43 


LSLFGIHVMNPFWSMSTSSVRKRSEGEERTLTGDVKTSPPRTAP 
KKQLPSIPKNALPITKPTSPAPAAQSTNGTHASYGPFYLEYSLL 
AEFTLWKQKLPGVYVQPSYRSALMWF3VIFIRHGLYQDGVFKF 
TVYIPDNYPDGDCPRLVFDIPVFHPLVDPTSGELDVKRAFAKWR 
RNHNHIWQVLMYARRVFYKIDTASPLNPEAAVLYEKDIQLFKSK 
WDSVKVCTARLFDQPKIEDPYAISFSPWNPSVHDEAREKMLTQ 
KKKPEEQHNKSVHVAGLSWKPGSVQPFSKEEKTVAT 


6284 


1 


2 8 79 


RSVIPGSTISSRVJPGLSRPRKMAAHEWDWFOREELIGOISDIRV 
QNLQVERENVQKRTFTRWINLHLEKCNPPLEVKDLFVDIQDGKI 
LFALLEVLSGRNLLIIEYKSSSHRIFRLMNIAKALKFLEDSNVKL 
VSIDAAEIADGNPSLVLGLIWNIILFFQIKELTGNLSRNSPSSS 
LAPGSGGTDSDSSFPPTPTAERSVAISVKDQRKAIKALLAWVQR 
KTRKYGVAVQDFAGSWRSGLAFLAVIKAIDPSLVDMKQALENST 
RENLEKAFS I AQDALHI PRLLEPED I MVDTPDEQS I MTYVAQFL 
ERFPELEAEDIFDSDKEVPIESTFVRIKETPSEQESKVFVLTEN 
GERTYTVNHETSHPPPSKVFVCDKPESMKEFRLDGVSSHALSDS 
STEFMHQIIDQVLQGGPGKTSDISEPSPESSILSSRKENGRSNS 
LPIKKTVHFEADTYKDPFCSKNLSLCFEGSPRVAKESLRQDGHV 
LAVEVAEEKEQKQESSK1PESSSDKVAGDIFLVEGTNNNSQSSS 
CNGALE S TARHDE ESHSLS P PGENTVMADS FQ I KWLMT VE ALE 
EGDYFEAI PLKAS KFNSDL IDFASTS QAFNKVPS PHETKPDEDA 

EGCPSAPEETPVDKKPEVHEKAKRKSTRPHYEEEGEDDDLQGVG 
EELSSSPPSSCVSLETLGSHSEEGLDFKP3PPLSICVSVIPHDLF 
YFPHYEVPLAAVLEAYVEDPEDLKNEEMDLEEPEGYMPDLDSRE 
EEADGSQSSSSSSVPGESLPSASDQVLYLSRGGVGTTPASEPA? 
LAPHEDHQQRETKENDPMDSHQSQESP1TL3NIANPLEENVTKES 
ISSKKKEKRKHVDHVESSLFVAPGSVQSSDDLEEDSSDYSIPSR 
TSHSDSSIYLRRHTHRSSESDHFSLCSVEERSRSG 


6285 


2157 


1331 


SCKTENLLEMWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHH 
IDPALPYISDTGTVAPEKCLFGAMLNIAAVLCIATIYVRYKQVH 
ALSPEEKVIIKLNKAGLVLGILSCLGLSIVANFQKTTLFAAHVS 
GAVLTFGMGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCG 

vsalsmltcssvlhsgnfgtdleqklhwnpedkgyvlhmittaa 
c i -;3!' i :fsf?gffltyirdfqkislrveanlhgltlydtapcpin 
nertrllsrdi 


6286 




276 


- 3 - t-C3SANPYVSVGKSCVLLAMAQLQTRFYTDNKKYAVDDV 
PFSIPAASEIADLSNIINKLLKDKNEFHKHVEFDFLIKGQFLRM 
PLDICEMEMENISSEEWEIEYVEKYTAPQPEQCMFHDDWISSIK 
3ASE . . ILTGSYDKTSRIWSLEGKS IMTI VGHTDWKDVAWVKKD 
SLSCLLLSASMDQTILLWEWNVERNKVKALHCCRGHAGSVDSIA 
VEGSC-TKFCSGSWDKMLKIWSTVPTDEEDEMEESTNRPRKKQKT 
" rRTPIVTLSGHMEAVSSVLWSDAEEICSASWDHTIRVWD 
VESGSLKSTLTGNKVFNCISYSPLCKRLASGSTDRHIRLWDPRT 
XDGSLVSLSLTSHTGWVTSVKWSPTHEQQLISGSLDNIVKLWDT 
RSCKAPLYDLAAHEDKVLSVDWTDTGLLLSGGADNKLYSYRYSP 
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Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, l=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S^Serine, T=Threonine, V=Valine 
W=Tryptophan, Y=Tyrosine, X=Unkr.own, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








TTSHVGA 


6287 


27-8 


1482 


MQFFFNFQIGLRSTSGKEKYSGDAGFLGDALQLFLQCLALDEDF 

VMEESQSLNEPSPKQSEEIPEVTSEPVKGSIiNRAQSAQSINSTE 
MPAREDCLKRVSSEPVLSVQEKGVLLKRKLSLLEQDVIVNEDGR 
NKLKKQGETPNEVCMFSLAYGDIPEELIDVSDFECSLCMRLFFE 
PVTTPCGHSFCKNCLERCLDHAPYCPLCKESLKEYLADRRYCVT 
QLLEELIVKYLPDELSERKKIYDEETAELSHLTKNVPIFVCTMA 

GCMLQIRNVHFLPDGRSWDTVGGKRFRVLKRGMKDGYCTADIE 


S2S8 


1 


743 


VTLYPCRGLVGNLLLGASGMASGCKIGPSILNSDLANLGAECLR 
MLDSGADYLHLDVMDGHFVPNITFGHPWESLRKQLGQDPFFDM 
HMMVSKPEQWVKPMAVAGANQYTFHLEATENP3ALIKDIRENGM ■ 
KVGLAIKPGTSVEYLAPWANQIDMALVMTVEPGFGGQKFMEDMM 
PKVHWLRTQFPSLDIEVDGGVGPDTVKKCAEAGANMIVSGSAIM 
RSEDPRSVINLLRNVCSEAAQKRSLDR 


5289 


1. 


743 


rL '-P^LVGNLLLGASGMASGCKIGPSILNSDLANLGAECLR 
MLDSGADYLHLDVNDGHFVPNITFGHPWESI.RKQLGQDPFFDM 
KMMV5KPEQVJVKPMAVAGANQYTFHLEATENPGALIKDIRENGM 
KVGLAIICPGTSVEYIAPWANQIDMALVMTVEPGFGGQKFMEDKM 
PKVHWLRTQFPSLDIEVDGGVGPDTVHKCAEAGANMIVSGSAIM 
RSEDPRSVIKLLRNVCSEAAQKRSLDR 








lijGRWLLGVYETVAPTLACLFI - _ ISRYTRKA 
VPQSLELKGITKHALNHHPPPEKLEEISPTSDSHEKDTSSQSKS 
DITRESSFTSADTGNSLSAFPSYTGAGISTEGSSDFSWGYGELD 
QNATEKVQTMFTAIDELLYEQKLSVHTKELQEECQQWTASFPHL 
RILGRQIITPSEGYRLYPRSPSAVSASYETTLSQERDSTIFGIR 
GKKLHFSSSYAHKASSIAKSSSFCSMERDEEDSIIVSEGIIEEY 
LAFDHIDIEEGFHGKKSEAATEKQKLGYPPIAPFYCMKEDVLAY 
VFDSVWCKWSCMEQLTRSHWEGFASDDESNVAVTRPDSESSCV 
LSELHPLVLPRVPQSKVLYITSWPMSLCQASRHQPNVNDLLVHG 

TSSLSYTVQSTRRRNPPPRTLHPISTSHSCAETPRSVEEILRGA 
RVPVAPDSLSSPSPTPLSRNNLLPPIGTAEVEHVSTVGPQRQMK 
PHGDSSRAQSAWDEPNYQQPQERLLLPDFFPRPNTTQSFLLDT 
QYRRSCAVEYPHQARPGRGSAGPQLHGSTKSQSGGRPVSRTRQG 
P 


6291 


1732 




PFAKYNRLGQLSCALCNTPVKSELLWQTHVLGKQHREKVAELKG 
AKEASQGSSASSAPQSVKRKAPDADDQDVKRAKATLVPQVQPST 
SAWTTNFDKIGKEFIRATPSKPSGLSLLPDYEDEEEEEEEEEGD 
G2RKRGDASKPLSDAQGKEHSVSSSREVTSSVLPNDFFSTNPPK 
APIIPHSGSIEKAEIHEKWERRENTAEALPEGFFDDPEVDARV 
RKVDAPKDQMDKEWDEFQKAMRQVNTISEAIVAEEDSEGRLDRQ 
IGEIDEQIECYRRVEKLRNRQDEIKNKLKEILTIKELQKKEEEN 
ADSDDEGELQDLLSQDWRVKGALiL 


6292 


1835 


1142 


' 1LM7APWTRFYSNSCCLCCHVRTGTILLGVWYLI INAW 
LLILLSALADPDQYNFSSSELGGDFEFMDDANMCIAIAISLLMI 
LICAMATYGAYKQRAAWIIPFFCYQIFDFALNMLVAITVLIYPN 
SIQEY IRQLPPNFP YRDDVMS VNPTCLVLI ILLFI S 1 1 LTFKGY 
LIS CVWNC YRY INGRNS SDVLVYVTSNDTT VLLP P YDDATVNGA 
AKEPPPPYVSA 


6293 


2382 


1035 


FWCTLGTVDVHPIGWCAINSKILVPPRTIHAKFTDWKGYLMKRL 
VGSRTLPVDFHIKMVESMKYPFRQGMRLEWDKSQVSRTRMAW 
DTVIGGRLRLLYEDGDSDDDFWCHMWSPLIHPVGWSRRVGHGIK | 
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Ammo acxd segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 

L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








MSERRSDMAHHPTFRKIYCDAVPYLFKKVRAVYTEGGWFEEGMK 
LEAIDPLNLGNICVATVCKVLLDGYLMICVDGGPSTDGLDWFCY 
HASSHAIFPATFCQKNDIELTPPKGYEAQTFNWENYLSKTKSKA 
APSRLFNMDCPNHGFKVGMKLEAVDLMEPRLICVATVKRWHRL 
LSIHFDGWDSEYDQWVDCESPDIYPVGWCELTGYQLQPPVAAEP 

PQGARKISSEPVPGEI I AVRVKEEHLDVAS PDKASS PELPVS VE 
NIKQETDD 


6294 


354 


1814 


AQLTTRGRTVAGGVRW I PS PF PDLELYS CCLGTDRGFPELSHHC 
KWVIATASDYDMAEITNIRPSFDVSPWAGLIGASYLWCVSVT 
VFVWSCCHQQAEKKHKNPPYKFIHMLKG IS I YPETLSNKKKI I K 
VRRDKDGPGREGGRRNLLVDAAEAGLLSRDKDPRGPSSGSCIDQ 
LPIKMDYGEELRSPITSLTPGESKTTSPSSPEEDVMLGSLTFSV 
DYNFPKKALWTIQEAHGLPVMDDQTQGSDPYIKMTILPDKRHR 
VKTRVLRKTLDPVFDETFTFYGIPYSQLQDLVLHFLVLSFDRFS 
RDDVIGEVMVPLiAGVDPSTGKVQLTRDIIKRMIQKCISRGELQV 
SLSYQPVAQRMTVWLKARHLQKMDIAGLSGKTPYVKVNVYYGRK 
RIAKKKTHVKKCTLNPIFNESFIYDIFTDLLPDISIEFLVIDFD 
RTTKNEWGRL I LGAHSVTAS GAEHWKEVCES PRKPVAKWHSLS 
EY 


6295 


2795 


617 


VSSALLTGATSGSDAAKSEGASASPLSCTNAVAMDRPDEGPPAK 
TRELSSSES PQRDPP P PP P PPPLLRLPLPPPQQRPRLQEETEAA 

TIESGYGEAPPPTESLEALPTPEASGGSLEIDFQWQSSSFGGE 
GALETCSAVGWAPQRLVDPKSKEEAIIIVEDEDEDERESMRSSR 
RRRRRRRRKQRKVKRESRERNAERMESILQALEDIQLDLEAVNI 

I S I L INRRDED I FR YLTNLQVQDLRH ISMG YKMKLYFQTNP YFT 
NM V IVKE FQRNRS GRLVSHSTPI RWHRGQE PQARRHGNQDASHS 
FFSWFSNHSLPEADRIAE I IKNDLWVNPLRYYLR3R3SRI KRKK 
QEMKKRKTRGRCEWIMEDAPDYYAVEDIFSEISDIDETIHDIK 
ISDFMETTDYFETTDNEITDINENICDSENPDHNEVPNNETTDN 
NESADDHETTDNNESADD1TNENPEDWNKNTDDWEENPNKNENTY 
GNNFFKGGFWGSHGWNQDSSDSDNEADEASDDEDHDGNEGDNEG 
SDDDGNEGDNEGSDDDDRDIEYYEKVI3DFDKDQADYEDVIEII 
SDESVEEEC IEEG IQQDEDI YEEGNYEEEGSEDWEEGEDSDDS 
DLEDVLQVPNGWANPGKRGKTG 


6296 


727 


1193 


RHCGCDAQGACDSLPPTGTSSPVTAKJMAIPEARCCVWLLDGTTV 
EAVRPARERLARKELRQKRMQQFSRDSAYSSNKDSTCLLTERDT 
. PSPFSGTISFGSFSDSGIFPLG3QCCLGFQQFS1SGK 
KWALIHKRVRLSVFGARWGRIYFGK 


6297 




922 


QRAAAASPSSCGPRGAEYGALMAMEGYWRFLALLGSALLVGFLS 
VI FALVWVLHYREGLGWDGSALE FNWHPVLMVTGFVFI QGIAI I 
VYRLPWTWKCSKLLMKSIHAGLNAVAAILAIISWAVFENHNVN 
N] H iSLHSWVGLIAVICYLLQLLSGFSVFLLPWAPLSLRAFL 
MPIHVYSGIVIFGTVIATALMGLTEKLIFSLRDPAYSTFPPEGV 
FVTJTLGLL ILVFGAL I FWIVTRPQWKRPKEPNSTILHPNGGTEQ 
[PAYSGNNMDKSDSELNNEVAARKRNLALDEAGQRSTM 


6298 


3 


985 


S VPLRRLSLS GTLQGAGTT TKMAVARLAAVAAWVP CRS WGWAAV 
PFGPHRGLSVLLARIPQRAPRWLPACRQKTSLSFLNRPDLPNLA 
YKKLKGKSPGIIFIPGYLSYMNGTKALAIEEFCKSLGHACIRFD 
YSGVGSSDGNSEESTLGKWRKDVLSIIDDLADGPQ1LVGSSLGG 
|..,T,Mr,!TA/;.iARPEKVVALIGVATAADTLVTKFNQLPVELKKEVEM 
MPS K YS EEGVYNVQYS F I KEAEHHCLLHS P I P VNCP IRL 
T HGM nDlVPWHTSMQVADRVLSTDVDVILRKHSDHRMREKADI 
QLLVYTIDDLIDKLSTIVN 
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ID 
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to first 

residue of 
amino acid 


Predicted end 
nucleotide 

corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysneine, D=Aspartic Acid, E= 
Glutamic Acid, F=?henylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 


6299 


512 




E -IMPNVTISLSLPTNGSPLQDILVHPCVTSLDSAILTSS 
SIDAMDDSAFSGPYKFPFTPPLESFNLCFYTSQVPVPPILGFYQ 
MKEEEVQLRNNH 


5300 


121 




AAPS CWSQRG VPAAC-T P S S PRLLVS RAAAP S AG ? WGAWRQGARA 
AQSPFSIPNSSSVPYGSQDSVHSSPEDGGGGRDRPVGGSPGGPR 
LVIGSLPAHLSPHMFGGFKCPVCSKFVSSDEMDLHLVMCLTKPR 
ITYNEDVLSKDAGECAICLEELQQGDTIARLPCLCIYHKGCIDE 
WFEVNRSCPEHPSD 


6301 


516 


284 


GKFVPVNWEPPQPLPFPKYLRCYRCLLETKELGCLLGSDICLTP 
AGSSCITLHKKNSSGSDVMVSDCRSKEQMSDCSNTRTSPVSGFW 
IFSQYCFLDFCNDPQNRGLYTP 


6302 


490 


745 


IFGFLHLFHMEHSFLLVCALFAHVFFSSSCGSSVALHSDPCLLS 
PVLLIJCLPGDLRPLDELYAQKLKYKAISEELDHALNDMTSL 


6303 


2 


1961 


, I E GGGLLWQSWQEKHPGQALS5EPWNFPDTKEKWEQ-YSQL 
YWYYLEQFQYWEAQGWTFDASQSCDTDTYTSKTEADDKNDEKCM 
KVDLVSFLSSPIMGDNDSSGTSEKDHSEILDGISNIKXKSEEVT 
QSQLDSCTSKDGHQQLSEVSSKRECPASGQSEPRNGGTNEESNS 

sgntntdppaedsqkssgantskdrphasgtdgdeseedppehk 
psklkrsheld idenpasdfddsgsllgfkygsgqkyggi pnfs 
hrqvryijEknvklkskyldmrrqikmknkhifftkesbkpffkk 
skilskvekfltwvnkpmdee sqe ii t cdseeq 
dmsvkkgddlletnnpepekcqsvssageletekyerdsllatv 
pdeqdcvtqevpdsrqaeteaevkkkknkkknkkvnglppeiaa 
vpelakywacryrlfsrfddgikldregwfsvtpekiaehiagr 
vsqs f kcd vwdafcgjvggnt i qfaltgmr vi ai d i d p vk i ala 
rnnaevygiadkieficgdflllasflkadwflsppwggpdya 
taetfdirtmmspdgfeifrlskkitnnivyflprnadidqvas 
lagpggqveieqnflnnklktitayfgdlirrpaset 


63 01 


1 


143 8 


hrarvdrsrespggdlrhpgrvrrditi.sghprlstqhvvi.JjRe 
devgdpgtkdlghpqhgspiqetqsewtlvsplpgsdmaalpa 
wratsgltlwphtaegrdllgaenraltggqqaedptlasgayq 

WPGSVEKLQSSVWCDAETLLSSSRTGGQAPPWLTDHDVQMLRLL 
AQGEWDKARVPAHGQVLQVGFSTEAALQDLSSPRLSQLCSQGIj 
CGLIKRPGDLPEVLSFHVDRVLGLRRSLPAVARRFHSPLLPYRY 
TDGGARPVIWWAPDVQHLSDPDEDQNSLALGWLQYQALLAHSCN 
WP3QAPCPGIHHTEWARLALFDFLLQVHDRLDRYCCGFEPEPSD 
PCVEERLREKCRNPAELRLVHILVRSSDPSHLVYIDMAGNLQHP 
EDKLNFRLLE3IDGFPESAVKVLASGCLQNMLLKSLQMDPVFWE 
SQGGAQGLKQVLQTLEQRGQVLLGHIQKHNLTLFRDEDP 


6305 


99 


420 


■1STYRPRPRRSVPPPELIGFMLEPGDEEPQQEEPPTES 
RDPAPGQEREEDQGAAETQVPDLEADLQELSQSKTGDECGDGPD 
VQGKILTKSEQFKMPEGR 


6306 




1874 


PTRPSKVKVPHTFLIHSYTRPTVCQACKKLLXGLFRQGLQCKDC 
KFNCHKRCATRVPNDCLGEALINGDVPMEEATDFSEADKSALMD 
ESEDSGVIPGSHSENALHASEEEEGEGGKAQSSLGYIPLMRWQ 
E V RHTTRKS STTLREGWWHYSNKDTLRKRHYWRLDCKC I TLFQ 
NNTTNR Y YKE IPLSEILTVE S AQNFSLVP PGTNPHC FE I VTANA 
TYFVGEMPGGTPCCPSGQGAEAARGWETAIRQALMPVILQDAPS 
APGHAPHRQASLS IS VSNSQ I QENVDIATVYQ I FPDEVLGSGQF 
GWYGGKHRKTGRDVAVKVIDKLRFPTKQESQLRNEVAILQSLR 
HPGIVNLECMFETPEKVFWMEKLHGDMLEMILSSEKGRLPERL 
TKFLI TQ I LVALRHLHFKNIVHCDLKPENVLLASADP FPQVKLC 
D FGFAR IIGEKS FRRS WGTPAYLAPEVLLNQGYNRS LDMWSVG 
VIMYVSLSGTFPFNEDEDINDQIQNAAFMYPASPWSEISAGAID 
LINNLLQVKMRKRYSVDKSLSHPWLQEYQTWLDLRELEGKMGER 
YITHESDDARWEQFAAEHPLPGSGLPTDRDLGGACPPQDHDMQG 
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SEQ 
ID 
WO: 


beginning 
nucleotide 

corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted er.d 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Hi3tidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








LAERISVL 


S307 


213S 


589 


CFLLPRGRDPEPPEAGAAAPCAPGAPDMSFRKVVRQSKFRHVFG 
QPVKNDQCYEDIRVSRVTWDSTFCAVNPKFLAVTVEASGGGAFL 
VLPLSKTGRIDKAYPTVCGHTGPVLDIDWCPHNDEVIASGSEDC 
TVMTOQIPEMGLTSPLTEPVWLEGHTKRVGI1AWHPTARNVLL 
SAGCDNWLIWWGTAEELYRLDSLHPDLIYNVSWNHNGSLFCS 
ACKDKSVRIIDPRRGTLVAEREKAHEGARPMRAIFLADGKVFTT 
GFSRMSERQLALWDPENLEEPMALQELDSSNGALLPFYDPDTSV 
VYVCGKGDS SIRYFEI TEEPPY I HFLNTFTS KE PQRGMGSMPKR 
GLEVSKCEIARFYKLHERKCEPIVMTVPRKSDLFQDDLYPDTAG 
PEAALEAEEWVSGRDADPILISLREAYVPSKQRDLKISRRNVLS 
DSRPAMAPGSSHLGAPASTTTAADATPSGSLARAGEAGKLEEW. 
QELRALRALVKEQGDRICRLEEQLGRMENGDA 


S308 


2 




GRPTRPEKMLLSLVLHTYSMRYLLPSVVLLGTAPTYVLAWGVWR 
LLSAFLPARFYQALDDRLYCVYQSMVLiFFFENYTGVQIIiLYGDL 
PKNKENIIYLANHQSTVDWIVADILAIRQNALGHVRYVLKEGLK 
WLPLYGWYFAQHGGIYVKRSAKFNEKEMRNKLQSYVDAGTPMYL 
VIFPEGTRYNPEQTKVLSASQAFAAQRGLAVLKHVLTPRIKATH 
VAFDCMKNYLDAIYDVTWYEGKDDGGQRRESPTMTEFLCKECP 
KIHIHIDRIDKKDVPEEQEHMRRWLHERFEIKDKMLIEFYESPD 
PERRKRFPGKSOTSKr.SIKKTLPSMLILSGLTAGMMTDAGRKL 
fVTlKA 


6309 


22'J 


563 


LVAI 3 I IL5VDMEWKENGSVGVKNSMENGRPPDPADWA 
VMDVVNYFRTVGFEEQAS AFQEQE I DGKS LLLMTRNDVLTGLQL 
KLGPALKIYEYHVKPLQTKHLKMNSS 


6310 


36 


979 


GPRGWKFLILSSVNCETLRIGKAWPQSSGQE] ~~ 

AQRGSLAELNVAAAGLWADCDQPLYDCPMCGLICTNYHILQEHV 

DLHLEENSFQQGMDRVQCSGDLQLAHQLQQEEDRKRRSEESRQE 

IEEFQKLQRQYGLDNSGGYKQQQLRNMEIEVNRGRMPPSEFHRR 

KADMMESLALGFDDGXTKTSGIIEALHRYYQNAATDVRRVWLSS 

WDHFHSSLGDKGWGCGYRNFQMLLSSLLQNDAYNDCLKGMLIP 

CIPKIQSMIEDAWKEGFDPQGASQLIIRLQGTKAWIGACEVYIL 

LTSLRV 


6311 




675 


PVWWWSCEGPRLAAAARTGHGVGRRARLACLGEPRVKAAVMLTL 
ASKLKRDDGLKGSRTAATASDSTRRVSVRDKLLVKEVAELEANL 
PCTCKVHFPDPNKLHCFQLTVTPDEGYYQ.GGKFQFE7EVPDAYN 
MVPPKVKCLTKIWHPNITETGE I CLSLLREHS IDGTGWAPTRTL 
KDWWGLNSLFTDLLNFDDPLNIEAAEHHLRDKEDFRWKVDDYI 
KRYAR 


6312 


213 


1400 


GDELVKREAGMKMLPGVGVFGTGSSARVLVPLLRAEGFTVEALW 
GKTEEEAKQLAEEMNIAFYTSRTDDILLHQDVDLVCISIPPPLT 
RQISVKALGIGKNWCEKAATEVDAFRMVTASRYYPQLMSLVGN 
VLRFLPAFVRMKQLISEHYVGAVMICDARIYSGSLLSPSYGWIC 
DELMG3GGLHTMGTYIVDLLTHLTGRRAEKVHGLLKTFVRQNAA 
IRGIRHVTSDDFCFFQMLMGGGVCSTVTLNFNMPGAFVHEVMW 
GSAGRLVARGADLYGQKN3ATQEELLLRDSLAVGAGLPEQGPQD 
VPLLYLKGMVYMVQALRQS FQGQGDRRTWDRTPVSMAAS FEDGL 
YMQSWDAIKRSSRSGEWEAVEVLTEEPDTNQNLCEALQRNNL 


6313 


2 


2071 


i. RSC P ^RIAFLPSPFSPACVHRSPLSFHC-CWFYFWVFMPLGVL 
FHRRRAHGCTLSCSSFVEQPTAMEAEETKECLQEFPEHHKMILD 
RLNEQREQDRFTDITLIVDGHHFKAHKAVLAACSKFFYKFFQEF 
TQEPLVEIEGVSKMAFRHLIEFTYTAKLMIQGEEEANDVWKAAE 
FLQMLEAIKALEVRNKENSAPLEENTTGKKEAKKRKIAETSNVI 
TESLPSAESEPVEIEVEIAEGTIEVEDEGIETLEEVASAKQSVK 
YIQSTGSSDDSALALLADITSKYRQGDRKGQIKEDGCPSDPTSK 
QVEGIEIVELQLSHVKDLFHCEKCNRSFKLFYHFKEHMKSHSTE 
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Predicted 


Predicted end 


Amino acid segment containing signal peptide 








(A=Alanme, C=Cysteme, D=Aspartic Acid, E= 


: 


location S 




Glutamic Acid, F=Phenylalanine, G=Glycine, 






corresponding 






corresponding 


to first 


L=Leucine, M=Methionine, N=Asparagine , 




to first 


amino acid 


P=Proline, Q=Glutamine, R=Arginine, 




amino acid 


residue of 


S=Serine, T=Threonine, V=Valine, 








W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 






sequence 


Codon, /=possible nucleotide deletion, 




sequence 




\=possible nucleotide insertion) 








SFKCEICNKRYLRESAWKQHLNCYHLEEGGVSKKQRTGKKIHVC 








QYCEKQFDHFGHFKEHLRKHTGEKPFECPNCHEEFARNSTLKCH 








LTACQTGVGAKKGRKKLYECQVCNSVFNSWDQFKDHLVIHTGDK 








PNHCTLCDLWFMQGNELRRHLSDAHHISERLVTEEVLSVETRVQ 








TEPVTSMTIIEQVGKVHVLPLLQVQVDSAQVTVEQVHPDLLQDS 








QVHDSHMSELP EQVQVS YLEVGRI QTEEGTEVHVEELHVERVNQ 








MPVEVQTELLEADLDHVTPEIMNQEERESSQADAAEAAREDHED 








AEDLETKPTVDSEAEKAENEDRTALPVLE 


6314 


2 


2071 


QRSGAARLAFLPSPFSPACVHRSPLSFHGCWFYFVVVFMPLGVL 
FHRRRAHGCTLSCSSFVEQPTAKEAi^Ei'i-lECi^C-jrLr PEHHKMILD 
RLNEQREQDRFTDITLIVDGHHFKAHKAVLAACSKFFYKFFQEF 
TQEPLVEIEGVSKMAFRHLIEFTYTAKLMIQGEEEANDVWKAAE 
FLQMLEAIKALEVRNKENSAPLEENTTGKNEAKKRKIAETSNVI 
TESLPSAESEPVEIEVEIAEGTIEVEDEGIETLEEVASAKQSVK 
YIQSTGSSDDSALALLADITSKYRQGDRKGQIKEDGCPSDPTSK 
QVEGIEIVELQLSHVKDLFHCEKCNRSFKLFYHFKEHMKSHSTE 
SFKCEICNKRYLRESAWKQHLNCYHLEEGGVSKKQRTGKKIHVC 
QYCEKQFDHFGHFK~HT,RKHT3EKPFECPNCHERFARNSTLKCH 
LTACQTGVGAKKGRKKLYECQVCNSVFNSWDQFKDHLVIHTGDK 
PNHCTLCDLWFMQGNELRRHLSDAHNISERLVTEEVLSVETRVQ 
TEPVTSMTIIEQVGKVHVLPLLQVQVDSAQVTVEQVHPDLLQDS 
QVHDSHMSELPEQVQVSYLEVGRIQTEEGTEVHVEELHVERVNQ 
MPVEVQTELLEADLDHVTPF.IMNQEERESSQADAAEAAREDHED 


6315 


1 


1015 


L,GLAVWWTTLVLISYCPTA'rEEAPYWTYLLCALGLPIYQSLDA 
IDGKQARRTNS CS PLGELFDHGCDSLS TVFMAVGAS IAARLGTY 
PDWFFSCSFIGMFVFYCAHWQTYVSGMLRFGKVDVT3IQIALVI 
VFVLSAFGGATMWDYTIPILEIKLKILPVLGFLGGVIFSCSNYF 
HVILHGGVGKNGSTIAGTSVLSPGLHIGLIIILAIMIYKKSATD 
VFEKHPCLYILMFGCVFAKVSQKLV\ r AHMTKSELYLQDTVFLGP 
GLLFLDQYFNNFIDEYWLWMAMVISSFDMVIYFSALCLQISRH 
LHLNI FKTACHQAPEQVQVLSSKSHQMNMD 


6316 


1503 


792 


VSAGAGTGIMGGTTSXRRVTFEADENEHITWKGIRLSENVIDR 
MKESSPSGSKSQRYSGAYGASVSDEELKRRVAEELALEQAKKES 
EDQKRLKQAKELDRERAAANEQLTRAILRERICSEEERAKAKHL 
ARQLEEKDRVLKKQDAFYKEQLARLEERSSEFYRVTTEQYQKAA 
EEVEAKFKRYESHPVCADLQAKILQCYRENTHCTLKCSALATQY 


6317 


102 


839 


PEAQ J'.- AVLAREKGHLPTMRHEAPMQMASAQDARYGQKDSSDQN 
FDYMFKLLI IGNS S VGKTS FLFRYADDS FTSAFVS TVG I DFKVK 
TVFKNEKRIKLQIWDTAGQERYRTITTAYYRGAMGFILMYDITN 
EESFNAVQDWSTQIKTYSWDNAQVILVGNKCDMEDERVISTERG 
QHLGEQLGFEFFETSAKDNIOTKQTFERLVDIICDKMSESLETD 
PAITAAKQNTRLKETPPPPQPNCAC 


6318 


1765 


733 


PWHPLRTLPLHHPHPRPPRAEGREGADSMSHLPGLELRREAPPL 
LGPLLSPFPLPAGSWHRQMLRSSLRFPITNSAGAPCKAAGRMWI 
LAPVRRDRVLAELPQCLRKEAALHGHKDFHPRVTCACQEHRTGT 
VGFKISKVIWGDLSVGKTCLIKRFCKDTFDKNYKATIGVDFEM 
ERFEVLGIPFSLQLWDTAGQERFKCIASTYYRGAQAIIIVFNLKT 
DVASLEHTKQWLADALKENDPSSVLLFLVGSKKDLSTPAQYALM 
EKDALQVAQEMKAEYWAVSSLTGENVREFFFRVAALTFEANVLA 
ELEKSGARRIGDWRINSDDSNLYLTASKKKPTCCP 


6319 


88 


717 


AATMRLNQNTLLLGKKWLVPYTSEHVPSRYHEWMKSEELQRLT 
ASEPLTLEQEYAMQCSWQEDADKCTFIVLDAEKWQAQPGATEES 
CMVGDWLFLTDLEDLTLGEIEVMIAEPSCRGKGLGTEAVLAML 
SYGVTTLGLTKFEAKIGQGNEPSIRMFQKLHFEQVATSSVFCEV 
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SEQ 
NO: 


Predicted 
beginning 
nucleotide 
location 

residue of 
sequence 


Predicted end 
nucleotide 

corresponding 

residue of 
amino acid 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 

P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, r=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








TLRLTVSESEHQWLLEQTSHVEEKPYRDGSAEPC 


S320 


90 


1111 


RPRTGREKVAMAAVDSFYLLYREIARSCNCYM3ALALVGAWYTA 
RKSITVICDFYSLIRLHFIPRLGSRADLIXQYGRWAVVSGATDG 

VADFSSGREIYLPIREALKDKDVGILVNNVGVFYPYPQYFTQLS 
EDKLWD 1 1 NVNI AAASLMVHWLPGMVERKKGAI VTI S SGSCCK 
PTPQLAAFSASKAYLDHFSRALQYEYASKGIFVQSLIPFYVATS 
MTAP S NFLHRCS WLVP S P KVYAHHAVSTLG I S KRTT GYWSHS I Q 
FLFAQYMPEWLWVWGANILNRSLRKEALSCTA 


6321 


141B 


341 


HRKAALGALI4AGRLLGKALAAVSLSLALA.SVTIRSSRCRGIQAF 
RNSFSSSWFHLNTKTTOSG3NGSKENSHNKARTSPYPGSKVERSQ 
VPNEKVGWLVEWQDYKPVEYTAVSVLAGPRWADPQISESNFSPK 
FNEKDGHVERKSKNGLYEIENGRPRNPAC-RTGLVGRGLLGRWGP 
NHAADPIITRV7KRDSSGNKIMHPVSGKHILQFVAIKRKDCGEWA 
IPGGMVDPGEKISATLKR3FGEEALNSLCKTSAEKREIEEKLHK 
LFSQDHLVIYKGYVDDPRNTDNAWMETEAWYHDETGEIMDNLM 
LEAGDDAGKVKWVDINDKLKLYASHSQFIKLVAEKRDAHWSEDS 
EADCHAL 


S322 


2047 




NQEILKNVESSRTVQPHFLEFLLSLGWSVDVGRIIPGWTGHVSTS 
WSINCCDDGEGSQQEEVISSEDIGASIFNGQKKVLYYADALTEI 
AFWPSPVESLTDSLESNrSDQDSDSNMDLMPGILKQPSLTLEL 
FPNHTDNLNSSQRLSPSSRMRKLPQGRPVPPLGPETRVSWWVE 
RYDDIENFPLSELMTEISTGVETTANSSTSLRSTTLEKEVPVIF 
IHPLNTGLFRIKIQGATGKFNMVIPLVDGMIVSRRALGFLVRQT 
VINICRRKRLESDSYSPPHVRRKQKITDIVNKYRNKQLEPEFYT 
SLFQEVGLKNCSS 


5323 




656 


PASTTDGAQEARVPLDGAFWIPRPPAGSPKGCFACVSKPPALQA 
PAAPAPEPSASPPMAPTLFPMESKSSKTDSVRAAGAPPACKHLA 
EKKTMTNETTVIEVYPDTTEVNDYYLWSIFNFVY1NFCCLGFIA 
LAYSLKVRDKKLLNDLNGAVEDAKTDRLINITRSGLAASCIMLW 
MALSVIATHRGLRSSASILVAEPHDWWTERPQVTFRERCPAL 


6324 


1 


2061 


EGAGMRRCPCRGSLNEAEAGALPAAARMGLEAPRGGRRRQPGQO. 
RPGPGAGAPAGRPEGGGPWARTEGSSLHSEPERAGLGPAPGTES 
PQAEFWTDGQTEPAAAGLGVETERPKQKTEPDRSSLRTHLEWSW 
SELGTTCLWTETGTDGLWTDPHRSDLQFQPEEASPWTQPGVHGP 
WTELETHGSQTQPERVKSWADNLWTHQNSSSLQTHPEGACPSKE 
PSADGSWKELYTDGSRTQQDIEGPWTEPYTDGSOXKQDTEAARK 
I- 1 Xi^IQQDTDGSWTQPSTDGSQTAPGTDCLLGEPEDGPLE 

RVEGGSGGFS SAS S FDESEDD WAGGGGASDPEDRSGSKPWKKL 
KTVLKYS PFWS FRKHYPWVQLSGHAGNFQAGEDGR I LKRFCQC 
EQRSLEQLMKDPLRPFVPAYYGMVLQDGQTFNQMEDLLADFEGP 
SIMDCKMGSRTYLEEELVKARERPRPRKDMYEKMVAVDPGAPTP 
EEHAQGAVTKPRYMQWRETMSSTSTLGFRIEGIKKADGTCNTNF 
KKTQALEQVTKVLEDFVDGDHVILQKYVACLEELREALEISPFF 
KTHEWGSSLLFVHDHTGLAKVWMIDFGKTVALPDHQTLSHRLP 
WAEGNREDGYLWGLDNMICI.LQGLAQS 


6325 


155 


944 


GLRDPFRRKRRLKPQVKMSNYVNDMWPGSPQEKDSPSTSRSGGS 
SRLSSRSRSRSFSRSSRSHSRVSSRFSSRSRRSKSRSRSRRRHQ 
RKYRRYSRSYSRSRSRSRSRRYRERRYGFTRRYYRSPSRYRSRS 
RSRSRSRGRSYCGRAYAIARGQRYYGFGRTVYPEEHSRWRDRSR 
TRSRSRTPFRLSEKDRMELLEIAKTNAAKALGTTNIDLPASLRT 
VPSAKETSRGIGVSSNGAKPEVSILGLSEQNFQKANCQI 


6326 




680 


GEPSPATQQKPSATGAGVLHQHFSSGHIYVLiMGLLPPPrtTISFT 
VQTTLQ P PGGLPAAP VSGRMAFE PVGRDLARRMVPRAGKRTQTL 
GARRVAAQGARPLPEDRRPKSGERLKVTVAPCWEFVLPSVSIiTA 
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Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 



Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Fhenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=I,eucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
_\=possible nucleotide insertion) 



QEASSGVP 



SLAHLAPAGGSWMPTQQPAAPSTRAPKPSRSLSGSLCALFSDA 
DSGSGMKAELPPGPGAVGREMTKSEKLQLRKEKKQQKKKRKEEK 
GAEPETGSAVSAAQCQGPTRELPESGIQLGTPREKVPAGRSKAE 
LRAERRAKQEAERALKQARKGEQGGPPPKASPSTAGETPSGVKR 
LPEYPQVDDLLLRRLVKKPERQQVPTRKDYGSKVSLFSHLPQYS 
RQNSLTQFMSIPSSVIHPAMVRLGLQYSQGLVRGSNARCIALLR 
ALQQVIQDYTTPPNEELSRDLVNKLKPYMSFLTQCRPLSASMHN 
AIKFLNKEITSVGSSKREEEAKSELRAAIDRYVQEKIVLAAQAI 
SRFAYQKISNGDVILVYGCSSLVSRILQEAWTEGRRFRWWDS 
RPWLEGRHTLRSLVHAGVPASYLLIPAASYVLPEVSTEEKDSKV 
GGEKV 



HASAEVTTAAARGLGAMEEEMHTDAKIRAENGTGSSPRGPGCSL 
RHFACEQNLLSRPDGSASFLQGDTSVLAGVYGPAEVKVSKEIFN 
KATLEVILRPKIGLPGVAEKSRERLIRNTCEAWLGTLHPRTSI 
TWLQWSDAGSLLACCLNAACMALVDAGVPMRALFCGVACALD 
SDGTLVLDPTSKQEKEARAVLTFALDSVERKLLMSSTKGLYSDT 
ELQQCLAAAQAAS QHVFRFYRESLQRRYSKS 



^SaVAAGGGTRSAMAEGSGEWTVSATGAANGLNNGAGGTSATT 
SNPLSRKLHKILETRLDNDKEMLEALKALSTFFVENSLRTRRNL 
RGDIERKSLAINEEFVS IFKEVKEELES I SEDVQAMSKCCQDMT 
SR1QAAKEQTQDLIVKTTKLQSESQKLEIRAQVADAFLSKFQLT 
SDEMSLLRGTREGPITEDFFKALGRVKQIHNDVKVLLRTNQQTA 
GLEIMEQMALLQETAYERLYRWAQSECRTLTQESCDVSPVLTQA 
MEALQDRPVLYKYTLDEFGTARRSTWRGFIDALTRGGPGGTPR 
PIEMHSHDPLRYVGDMLAWLHQATASEKEHLEALLKHVTTQGVE 
ENIQEWGHITEGVCRPLKVRIEQVIVAEPGAVLLYKISNLLKF 
YHHTISGIVGNSATALLTTIEEMHLLSKKIFFNSLSLHASKLMD 
KVELPPPDLGPSSALNQTLMLLREVLASHDSSWPLDARQADFV 
QVLSCVLDPLLQMCTVSASNLGTADMATFMVNSLYMMKTTLALF 
EFTDRRLEMLQFQIEAHLDTLINEQASWLTRVGLSYIYNTVQQ 
HKPEQGSLANMPNLDSVTLKAAMVQFDRYLSAPDNLLIPQLNFL 
LSATVKEQIVKQSTELVCRAYGEVYAAVMNPINEYKDPENILHR 
SPQQVQTLLS 



FFYYTFYENKTFSRKMVAEKETLSLNKCPDKMPKRTKLLAQQPL 
PVHQPHSLVSEGFTVKAMMKNSWRGPPAAGAFKERPTKPTAFR 
KFYERGDFPIALEHDSKGNKIAWKVEIEKLDYHHYLPLFFD3LC 
EMTFPYEFFAKQGIHDMLEHGGNKILPVLPQLIIPIKNALNLRN 
RQVI CVTLKVLQHLWSAEMVGKALVP YYRQ I LP VLNI FKNMNV 
NSGDGIDYSQQKRENIGDLIQETLEAFERYGGENAF1NIKYWP 
TYESCLLN 



QQGQRVRTRGRRACASATPLEGCVDLSYPRTHAALLKVAQMVTL 
LIAFICVRSSLWTNYSAYSYFEWTICDLIMILAFYLVHLFRFY 
RVLTCISWPLSELLHYLIGTLLLLIASIVAASKSYNQSGLVAGA 
I FGFMATFLCMASIWLS YKISCVTQSTDAAV 



.VTESNKFDLVSFIPLLRERIYSNHQYARQFIISWILVLESVPDI 
NLLDYLPE ILDGLFQ ILGDNGKE 1 RKMCEWLGE FLKE IKKNPS 
SVKFAEMANILVIHCQTTDDLIQLTAMCWMREFIQLAGRVMLPY 
SSGILTAVLPCLAYDDRKKSIKEVANVCNQSLMKLVTPEDDELD 
ELRPGQRQAEPTPDDALPKQEGTASGEWTPSLHLTSCRGPREPD 
VIGVALGPHLSNQDYFMYVTHTIVAATQRSGSSGSPPFCRQDTG 
K1STMAT SQLVICTGTGLEPRQAVSSSH 



TRTPS SAEAGGES PQSCVSAAHSDWTAGKPVSLLAPLI PPRSAG 
QPLTFSPSGRQPLRSLLVGMCSGSGRRRSSLSPTMRPGTGAERG 
GLMMGHPGMHYAPMGMHPMGQRANMPPVPHGMMPQMMPPMGGPP 
MGQMPGMMSSVMPGMMMSHMSQASMQPALPPGTOTSMDVAAGTAS 
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SEQ 
ID 
NO: 


Predicted 
beginning 

corre spending 
to first 

amino acid 


Predicted end 
nucleotide 

corresponding 
to first 

residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid," E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H^IIistidine, I-Isoleucine, K=Lysiae, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginina, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








GAKSMWTEHKSPDGRTYYYNTETKQSTWEKPDDLKTPAEQLLSK 
CPWKEYKSDSGKPYYYNSQTKESRWAKFKELEDLEGYQNTIVAG 
SLITKSNLHAMIKAEESSKQEECTTTSTAPVPTTEIPTTMSTMA 
AAEAAAAWAAAAAAAAAAAAANANASTSASNTVSGTVPVVPEP 
E VTS I VATWDNENTVT I STEEQAQLTSTPAIQDQSVEVS SNTG 
EETSKQETVADFTPKKEEEESQPAKKTYTWNTKEEAKQAFKELL 
KEKRVPSNASMEQAMKMIINDPRYSALAKLSEKKQAFNAYKVQT 


6334 


17 


644 


GGNPSGRAAGFAAAAMPSSPLRVAWCSSNQNRSMEAHNILSKR 
GFSVRSFGTGTHVKLPGPAPDKPNVYDFKTTYDQMYNDLLRKDK 
ELYTQNGILHMLDRNKRIKPRPERFQNCKDLFDLILTCEERVYD 
QWEDLNSREQETCQPVHWNVDIQDNHEEATLGAFLICELCQC 
IQHTEDMENEIDELLQEFEEKSGRTFLHTVCFY 


6335 


82 


529 


AARARPGVLCCRLLGAALGDQSRVEMSYIPGQPVTAWQRVEIH 
KLRQGENLILGFSIGGGIDQDPSQNPFSEDKTDKGIYVTRVSEG 
GPAEIAGLQIGDKIMQVNGWDMTMVTHDQARKRLTKRSEEVVRL 
LVTRQSLQKAVQQSMLS 


6336 


1003 


438 


mm i rr iglgpesrihllrn 

LLTGLVRHERIEAPWARVDEMRGYAEKLIDYGKLGDTNERAMRK 
ADFWLTEKDLIPKLFQVLAPRYKDQTGGYTRMLQIPNRSLDRAK 
MAVIEYKGNCLPPLPLPRRDSHLTLLNQLLQGLRQDLRQSQEAS 
NHSSHTAQTEGI 


6337 


76 


524 


EGIQMLSVQPDTKPKGCAGCNRKIKDRYr.LKALDKYWHKDC7r.KC 
ACCDCRLGEVGSTLYTKANLILCRRDYLRLFGVTGNCAACSXLI 
PAFEMVMRAKDNVYHLDCFACQLCNQRFCVGDKFFLKNNMrLCQ 
TDYE EGLMKEGYAP QVR 


S338 


66 


1349 


APNSESGTQGPLPTPANLFWTRRANPDPTTSMSATDRMGPKAVP 
GLRLALLLLLGLGTPKSGVQGQEGLDFPEYEGVDRVINVNAKNY 
KNVFKKYEVLALLYHEPPEDDKASQRQFEMEELILELAAQVLED 
KGVGFGLVDSEKDAAVAKKLGLTEVDSMYVFKGDEVIEYDGEFS 
ADTIVEFLLDVLEDPVELIEGERELQAFENIEDEIKLIGYFKSK 
DSEHYKAFEDAAEEFHPYIPFFATFDSKGAKKLTLKLNEIDFYE 
AFMEEPVTIPDKPNSEEEIVNFVEEHRRSTLRKLKPESMYETWE 
DDMDGIHrVAFAEEADPDGFEFLETLKAVAQDNTENPDLSIIWI 
DPDDFPLLVPYWEKTFDIDLSAPQIGWNVTDADRLWMEMDDEE 
DLPSAEELEDWLEDVLEGEINTEDDDDDDDD 


6339 


246 


1813 


NRCDRC-GGGQAERQAGQGCRTQGAGPGFGFGHSFFSQGAMKAFH 
TFCWLLVFGSVSEAXFDDFEDEEDIVEYDDNDFAEFEDVMEDS 
VTESPQRVriTEDDEDETTVELEGQDENQEGDFEDADTQEGDTE 
SEPYDDEEFE3YEDKPDTSSSKNKDPITIVDVPAHLQNSWESYY 
LEI LMVTGLLAY IMNYI IGKNKN3 RLAQAWFNTHRELLESNFTL 
VGDDGTNKEATSTGKLNQENEHIYNLWCSGRVCCEGMLIQLRFL 
KRQDLLNVLARMMRPVSDQVQIKVTMNDEDMDTYVFAVGTRKAL 
VRLQKEMQDLSEFCSDKPKSGAKYGLPDSLAILSEMGEVTDGMM 
DTKMVHFLTHYADKIESVHFSDQFSGPKIMQEEGQPLKLPDTKR 
TLLLTFNVPGSGNTYPKDMEALLPLMNMVIYSIDKAKKFRLNRE 
GKQKADKNRARVEENFLKLTHVQRQEAAQSRREEKKRAEKERIM 
NEEDPEKQRRLEEAALRREQKKLEKKQMKMKQIKVKAM 


6340 




583 


EACAHTLSCPAFARLGRARRRPWMSHRTSSTFRAERSFHSSSSS 
SSSSTSSSASRALPAQDPPMEKALSMFSDDFGSFMRPHSEPLAF 
PARPGGAGNIKTLGDAYEFAVDVRDFSPEDIIVTTSNNHIEVRA 
EKLAADGTVMNNFAHKCQLPEDVDPTSVTSALREDGSLTIRARR 
HPHTEHVQQTFRTEIKI 


6341 




64 5 


M L_ r _ r R i LGLRSS VGPAVQARGVHQSVATDGPSSTQ 
PALPKARAVAPKPSSRGEYWAKLDDLVNWARRSSLWPMTFGLA 
CCAVEMMHMAAPRYDMDRFGWFRASPRQSDVMIVAGTLTNKMA 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 
to first 

residue of 

sequence 


Predicted end 

location 
corresponding 
to first 

residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








PALRKVYDQMPEPRYWSMGSCANGGGYYHYSYSWRGCDRIVP 
VDIYIPGCPPTAEALLYGILQLQRKIKRERRLQIWYRR 


6342 


2 


1191 


DPRVRAMLATLARVAALRKTCLFSGRGGGHGLWTGREQSDIVrai 
KPLEGVKILDLTRVLAGPFATMNT.GDLGAEVIKVERPGAGDDTR 
TWGPPFVGTESTYYLS VNRNKKS I AVN I KDP KGVKIIKE LAAVC 
DVFVENYVPGKLSAMGLGYEDIDE IAPHI I YCS ITGYGQTGPIS 
QRAGYDAVASAVSGLMHITGPEVACLGIJIAANYLIGQKEAKRWG 
TAHGSIVPYQAFKTKDGYIWGAGNNQQFATVCKILDLPELIDN 
S KYKTNHLRVHNRKEL I KI LSERFEEELTS KWLYLFEGSGVPYG 
PINNMKNVFAEPQVLHNGLVMEMEHPTVGKISVPGPAVRYSKFK 
MSEARPPPLLGQHTTHILKEVLRYDDRAIGELLSAGWDQHETH 


6343 




9 3 6 


gtamvsdedelnllvivvdanpiwwgkqalkesqftlskcidav 
mvlgnshlfmnrsnklaviashiqesrflypgkngrlgdffgdp 
gnppefnpsgskdgkyelltsaneviveeikdlmtksdikgqht 
etllagslakalcyihrmnkevkdnqemksrilvikaaedsalq 
ymnfmnvifaaqkqnilidacvldsdsgllqqacditgglylkv 
pqmpsllqyllwvflpdodqrsqlilpppvhvdyraacfchr.nl 
ibigyvcsvclsifcnfspicttcetafkislppvlkakkkklk 

VSA 


6344 


2508 


147 


tmptatlgnlrgygmaspglaapsltppqlatpnlqqffpqatr 
qsllgpppvgvpmnpsqfnlsgrnpqkqartsssttpnrkdsss 
qtmpvedksdppegseeaaeprmdtped2dlppcpediakekrt 

PAPEPEPCEASELPAKRLRSSEEPTEKEPPC-QLQVKAQPQARMT 
VPKQTQTPDLLPEALEAQVLPRFQPRVLQVQAQVQSQTQPRIPS 
TDTQVQPKLQKQAQTQTSPEHLVLQQKQVQPQLQQKAEPQKQVQ 
PQVQPQAHSQGPRQVQLQQEAEPLKQVQPQVQPQAHSQPPRQVQ 
LQLQKQVQTQTYPQVHTQAQPSVQPQEHPPAQVSVQPPEQTHEQ 
PHTQPQVSLLAPEQTPWVHVCGLEMPPDAVEAGGGMEKTLPEP 
VGTQVSMEEIQNESACGLDVGECENRAR3MPGVWGAGGSLKVTI 
LQSSDSRAFSTVPLTPVPRPSDSVSSTPAATSTPSKQALQFFCY 
ICKASCSSQQEFQDHMSEPQHQQRLGEIQHMSQACLLSLLPVPR 
DVLETEDEEPPPRRWCNTCQLYYMGDLIQHRRTQDHKIAKQSLR 
PFCTVCNRYFKTPRKFVEHVKSQGHKDKAKELKSLEKEIAGQDE 
DHFITVDAVGCFEGDEEEEEDDEDEEEIEVEEELCKQVRSRDIS 
REEWKGSETYSPNTAYGVDFLVPVMGYICRICHKFYHSNSGAQL 
SHCKSLGHFEWLQKYKAAKNPSPTTRPVSRRCAINARNALTALF 
TSSGRPPSQPNTQDKTPSKVTARPSQPPLPRRSTRLXT 


S345 


2 


3483 


PRVRTKLILLVNDKKRYERVGGGPKRLGRDVEMEEMIEQLQEKV 
HELEKQNDTLKNRLISAKQQLQTQGYRQTPYNNVQSRINTGRRK 
ANENAGLQECPRKGIKFQDADVAETPHPMFTKYGNSLLEEARGE 
IRNLENVIQSQRGQIEELEHLAEILKTQLRRKEKEIELSLLQLR 
EQQATDQRSNIRDNVEMIKLHKQLVEKSIJALSAMEGKFIQLQEK 
1 L 1SHDALMANGDELNMQLKEQRLKCCSLEKQLHSMKFSER 
RIEELQDRINDLEKERELLKENYDKLYDSAF3AAHEEQWKLKEQ 
QLKVQIAQLETALKSDLTDKTEILDRLKTERDQNEKLVQENREL 
QLQYLEQKQQLDELKKRI KLYNQEND INADELSEALLLI KAQKE 
QKNGDLSFLVKVDSEINKDLERSMRELQATHAETVQELEKTRNM 
LIMQHKINKDYQMEVEAVTRKMENLQQDYELKVEQYVHLLDIRA 
ARIHKLEAQLKDIAYGTKQYKFKPEIMPDDSVDEFDETIHLBRG 
ENLFEIHINKVTFSSEVLQASGDKEPVTFCTYAFYDFELQTTPV 
VRGLHPEYNFTSQYLVHVNDLFLQYIQKNTITLEVHQAY3TEYE 
TIAACQLKFHHILEKSGRIFCTASLIGTKGDIPNFGTVEYWFRL 
RVPMDQAIRLYRERAKALGYITSNFKGPEHMQSLSQQAPKTAQL 
SSTDSTD3NLNELHITIRCCNHLQSRASHLQPHPYWYKFFDFA 
DHDTAIIPSSNDPQFDDHMYFPVPMNMDLDRYLKSESLSFYVFD 
DSDTQENIYIGKVNVPLISLAHDRCISGIFELTDHQKHPAGTIH 
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Predicted end I Amino acid segment containing signal peptide 

ldeotide (A=Alanine, C=Cysteine, D=Aspartic Acid, E= 

location Glutamic Acid, F=Phenylalanine , G=Glycine, 

corresponding H=Histidine, I=Isoleucine, K=Lysine, 
first L=Leucine, M=Methionine, N=Asparag±ne , 

no acid P=Proline, Q=Glutamine, R=Arginine, 

idue of S=Serine, T=Threonine, V=Valine, 

no acid W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 

uence Codon, /=possible nucleotide deletion, 

\=possible nucleotide inserti on) 

VILKWKFAYLPPSGSITTEDLGNFIRSEEPEWQRLPPASSVST 
LVLAPRPKPRQRLTPVDKKVSFVDIMPHQSDVSQEGSVDEVKEN 
T3KMQQGKDDVSLLSEGQLAEQSLASSEDETEITEDLEPEVEED 
MSASDSDDCIIPGPISKNIKQPSEKIRIEIIALSLNDSQVTMDD 
TIQRLFVECRFYSLPAEETPVSLPKPKSGQWVYYNYSNVIYVDK 
ENNKAKRDILKAILQKQEMPNRSLRFTWSDPPEDEQDLECEDI 
GVAHVDLADMFQEGRDLIEQNIDVFDARADGEGIGKLRVTVEAL 

HALQSVYKQYRDDLEA 

533 QDRRLLRLELQKTCQPTSTMSGSHTPACGPFSALTPS IWPQE IL 

AKYTQKEESAEQPEFYYDEFGFRVYKEEGDEPGSSLLANSPLME 
DAPQRLRWQAHLEFTHNHDVGDLTWDKIAVSLPRSEKLRSLVLA 
GIPHGMRPQLWMRLSGALQKKRNSELSYREIVKNSSNDETIAAK 
QIEKDLLRTMPSNACFASMGSIGVPRLRRVLRALAWLYPEIGYC 
QGTGMVAACLLLFLEEEDAFWMMSAIIEDLLPASYFSTTLLGVQ 
TDQRVLRHLIVQYLPRLDKLLQEHDIELSLITLHWFLTAFASW 
DIKLL^RIWDLFFYEGSRVLFOLTLGMLIILKEEELIQSENSASI 
FNTLSDIPSQMEDAELLLGVAMRLAGSLTDVAVETQRRKHLAYL 
IADQGQLLGAGTLTMLSQWRRRTQRRKSTITALLFGEDDLEAL 
KAKNIKQTELVADLREAILRVARHFQCTDPKNCSWSRQLPGLL 
PNTALTPPTPLVGLYSLWQELTPDYSMESHQRDHENYVACSRSH 
RRRAKALLDFERHDDDELGFRKND 1 1 T I VSQKDEHCWVGELNGL 
RGWFPAKFVEVLDERSKEYSIAGDDSVTEGVTDLVRGTLCPALK 
ALFEHGLKKPSLLGGACHPWLFIEEAAGREVERDFASVYSRLVL 

cktfrldedgkvltpeellyravqstovthdavhaqmdvklrsl 
icvglkeqvlhlwlevlcsslptvekwyqpwsflrspgwvq;kc 
elrvlccfafslsqdwelpakreaqqplkegvrdmlvkhiilfsw 

DVDG 

533 QDRRLLRLELQKTCQPTSTMSGSHTPACGPFSALTPS IWPQE I L 

AKYTQKEESAEQPEFYYDEFGFRVYKEEGDEPC-SSLLANSPLME 
DAPQRLRWQAHLEFTHNHDVGDLrWDKIAVSLPRSEKLRSLYLA 
GIPHGMRPQLWMRLSGALQKKRNSELSYREIVKNSSNDETIAAK 
QIEKDLLRTMPSNACFASMGSIGVPRLRRVLRALAWLYPEIGYC 
QGTGMVAAGT.LLFLEEEDAFWMMSAIIEDLLPASYFSTTLLGVQ 
TDQRVLRHLIVQYLPRLDKLLQEHDIELSLITLHWFLTAFASW 
DIKLLLRIWDLFFYEGSRVLFQLTLGMLHLKEEELIQSENSASI 
FNTLSDIPSQMEDAELLLGVAMRLAGSLTDVAVETQRRKHLAYL 
IADQGQLLGAGTLTNLSQWRRRTQRRKSTITALLFGEDDLEAL 
KAKNIKQTELVADLREAILRVARHFQCTDPKNCSWSRQLPGLL 
PNTALTPPTPLVGLYSLWQELTPDYSMESHQRDHKNYVACSRSH 
RRRAKALLDFERHDDDELGFRKWDIITIVSQKDEHCWVGELNGL 
RGWFPAKFVEVLDERSKEYSIAGDDSVTEGVTDLVRGTLCPALK 
ALFEHGLKKPSLLGGACHPWLFIEEAAGREVERDFASVYSRLVL 
CKTFRLDEDGKVLTPEELLYRAVQSVNVTHDAVHAQMDVKLRSL 
ICVGLNEQVLHLWLEVLCSSLPTVEKWYQPWSFLRSPGWVQIKC 
ELRVLCGFAFSLSQDWELPAKREAQQPLKEGVRDMLVKHHLFSW 

DVDG 

3 679 AGAEKCFVTLLACFLAKQQNKYKYEECKDLIKSMLRNELQFKEE 
KLAEQLKQAEELRQYKVLVHSQERELTQLREKLREGRDASRSLN 
EHLQALLTPDEPDKSQGQDLQEQLAEGCRLAQHLVQKLSPENDN 
DDDEDVQVEVAEKVQKSSSPREMQKAEEKEVPEDSLEECAITCS 
NSHGPCDSNQPHKNIKITFEEDEVNSTLWDRESSHDECQDALN 
ILPVPGPTSSATNVSMWSAGPLSGEKAAINILEINEKLRPQLA 
EKKQQFRNLKEKCFLTQLACFLAWQQNKYKYEECKDLIKFMLRN 
ERQFKEEKLAEQLKQAEELRQYKVLVHSQERELTQLREKLREGR 
DASRSLNEHLQALLTPDEPDKSQGQDLQEQLAEGCRLAQHLVQK 
L S P 3 MDNDDDEDVQVE VAE KVQKS S APREM P KAE EKEVPEDSLE 
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SEQ 
ID 
NO: 


Predicted 
beginning 

nucleotide 

corresponding 
to first 
amino acid 

sequence 


Predicted end 

nucleotide 

location 

to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L= Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamxne, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








ECAITCSNSHGPYDSNQPHRKTKITFEEDKVDSTLIGSSSHVEW 

EDAVHIIPENESDDEEEEEKGPVSPRNLQESEEEEVPQESWDEG 

YSTLSIPPEMLASYKSYSSTFHSLEEQQVCMAVDIC-RHRWDQVK 

KEDHEATGPRLSRELLDEKGPEVLQDSLDRCYSTPSGCLELTDS 

CQPYRSAFYVLEQQRVGLAVNMDEIEKYQEVEEDQDPSCPRLSR 

ELLDEKEPEVLQDSLGRCYSTPSGYLELPDLGQPYSSAVYSLEE 

QYLGLALDVDRIKKDQEEEEDQGPPCPRLSRELLiEWEPEVLQD 

SLDRCYSTPSSCLEQPDSCQPYGSSFYALEEKHV3FSLDVGEIE 

KKGKGKKRRGRRSKKERRRGRKEGEEDQNPPCPRLSRELLDEKG 

PEVLQDSLDRCYSTPSGCLELTDS CQPYRSAFYI LEQQRVGLAV 

DMDEIEKYQEVEEDQDPSCPRLSGEliLDEKEPEVLQESLDRCYS 

TPSGCLELTDSCQPYRSAFYILEQQRVGLAVDMDEIEKYQEVEE 

DQDPSCPRLSRELLDEKEPEVLQDSLGRCYSTPSGYLELPDLGQ 

PYSSAVYSLEEQYLGLALDVDRIKKDQEEEEDQGPPCPRLSREL ' 

LEWEPEVLQDSLDRCYSTPSSCLEQPDSCQPYG3SFYALEEKH 

VGFSLDVGEIEKKGKGKKRRGRRSKKERRRGRKEGEEDQNPPCP 

RLNSMLMEVEEPEVLQDSLDICYSTPSMYFELPDSFQHYRSVFY 

SFEEEHI SFALYVDNRFFTLTVTSLHLVFQMGVI FPQ 


S349 


3 


3679 


AGA2KCFVTLLACFLAKQQNKYKYEECKDLIKSMjRNELQFKEE 
KLAEQLKQAEELRQYKVLVHSQERELTQLREKLREGRDASRSLN 
EIILQALLTPDEPDKSQGQDLQEQLAEGCRLAQHLVQKLSPENDN 
( DDE J ,T JVFVAEKVQKSSSPREMQKAEEKEVPEDSL,EECAITCS 
NSHGPCDSNQPHKNIKITFEEDEVNSTLVVDRESSHDECQDALN 
ILPVPGPTSSATNVSMWSAGPLSGEKAAINILEINEKLRPQLA 
EKKQQFRWLKEKCFLTQLACFLANQQNXYKYEECKDLIKFMLRNT 
ERQFKEEKLAEQLKQAEELRQYKVLTOSQERELTQLREKLREGR 
DASRSLNEHLQALLTPDEPDKSQGQDLQE2LAEGCRLAQHLVQK 
LSPENDNDDDEDVQVEVAEKVQKSSAPREMPKAEEKEVPEDSLE 
ECAITCSNSHGPYDSNQPHRKTKITFEEDKVDSTLIGSSSHVEW 
EDAVHIIPENESDDEEEEEKGPVSPRNLQESEEEEVPQESWDEG 
YS TLS I P PEMLAS YKSYSSTFHSLEE QQVCMAVD I GRHRWDQ VK 
KEDHEATGPRLSRELLDEKGPEVLQDSLDRCYSTPSGCLELTDS 
CQPYRSAFYVLEQQRVGLAVKMDEIEKYQEVEEDQDPSCPRLSR 
ELLDEKEPEVLQDSLGRCYSTPSGYLELPDLGQPYSSAVYSLEE 
QYLGLALDVDRIKKDQEEEEDQGPPCPRLSRELLEWEPEVLQD 
SLDRCYSTPSSCLEQPDSCQPYGSSFYALEEKHVGFSLDVGEIE 
KKGKGKKRRGRRSKKERRRGRKEGEEDQNPPCPRLSRELLDEKG 
PEVLQDSLDRCYSTPSGCLELTDSCQPYRSAFYILEQQRVGIAV 
DMDEIEKYQEVEEDQDPSCPRLSGELLDEKEPEVLQESLDRCYS 
TPSGCLELTDSCUPYRSAFYILEQQRVGLAVDMDEIEKYQEVEE 
DQDPSCPRLSRELLDEKEPEVLQDSLGRCYSTPSGYLELPDLGQ 
PYSSAVYSLEEQYLGLALDVDRIKKDQEEEEDQGPPCPRLSREL 
LEWEPEVLQDSLDRCYSTPSSCLEQPDSCQPYGSSFYALEEKH 
VGFSLDVGEIEKKGKGKKRRGRRSKKERRRGRKEGEEDONPPCP 
i J- LMEVEEPEVLQDSLDICYSTPSMYFELPDSFQHYRSVFY 
SFE3EHIS FAL YVDNRF FTLT VTS LHLVFQMGVI FPQ 


6350 


3 


3679 


-1 B "TLLACFLAKQQNKYKYEECKDLIKSMLRNELQFKEE 
KLAEQLKQAEELRQYKVL.VHSQERELTQLREKLREGRDASRSLN 
EHLQALLTPDEPDKSQGQDLQEQLAEGCRLAQHLVQKLSPENDN 
DDD3DVQVEVAEKVQKSSSPREMQKAEEKEVPEDSLEECAITCS 
NSH3PCDSNQPHKNIKITFEEDEVNSTLWDRESSHDECQDALN 
ILPVPGPTSSATNVSMWSAGPLSGEKAAINILEINEKLRPQLA 
3i . 'FRNLKEKCFLTQLACFLANQQNKYKYEECKDLIKFMLRN 
ERQFKEEKLAEQLKQAEELRQYKVLVHSQERELTQLREKLREGR 
DASRSLNEHLQALLTPDEPDKSQGQDLQEQLAEGCRLAQHLVQK 
LSP3NDNDDDEDVQVEVAEKVQKSSAPREMPKAEEKEVPEDSLE | 
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SEQ 
ID 
NO: 


beginning 
nucleotide 

corresponding 
to first 
amino acid 
residue of 
amino acid 


Predicted end 
nucleotide 

corresponding 

amino acid 

amino acid 
sequence 


Ammo acid segment containing signal peptide 
<A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methlonine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine , V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








ECAITCSNSHGPYDSNQPHRKTKITFEEDKVDSTLIGSSSHVEW 
EDAVHIIPENESDDEEEEEKGPVSPRNLQESEEEEVPQESWDEG 
YSTLSIPPEMLASYKSYSSTFHSLEEQQVCMAVDIGRHRWDQVK 
KEDHEATGPRLSRELLDEKGPEVLQDSLDRCYSTPSGCLELTDS 
CQPYRSAFYVLEQQRVGLAVNMDEIEKYQEVEEDQDPSCPRLSR 
ELLDEKEPEVLQDSLGRCYSTPSGYLELPDLGQPYSSAVYSLEE 
QYLGLALDVDRIKKDQEEEEDQGPPCPRLSRELLEWEPEVLQD 
SLDRCYSTPSSCLEQPDSCQPYGSSFYALEEKHVGFSLDVGEIE 
KKGKGKKRRGRRSKKERRRGRKEGEEDQNPPCPRLSRELLEEKG 
PEVLQDSLDRCYSTPSGCLELTDSCQPYRSAFYILEQQRVGLAV 
DMDEIEKYQEVEEDQDPSCPRLSGELLDEKEPEVLQESLDRCYS 

DQDPSCPRLSRELLDEKEPEVLQDSLGRCYSTPSGYLELPDLGQ 
PYSSAVYSLEEQYLGLALDVDRIKKDQEEEEDQGPPCPRLSREL 
LE WEPE VLQDSLDRC YSTPS S CLEQFDS CQPYGS S FYALEE KH 
VGFSLDVGEIEKKGKGKKRRGRRSKKERRRGRKEGEEDQNPPCP 
RLNSMLMEVEEPEVLQDSLDICYSTPSMYFELPDSFQHYRSVFY 


6351 


1291 


319 


R3ARRRTERSQLGRMLVV3VANGRSLVWGAEAVQALRERLGVGG 
RTVGALPRGPRQNSRLGLPLLLMPEEARLLAEIGAVTLVSAPRP 
DSRHHSLALTSFKRQQEESFQEQSAIAAEARETRRQELLEKITE 
GQAAKKQKLEQASGASSSQEAGSSQAAKEDETSDGCASGEQEEA 
GPSSSQAGPSNGVAPLPRSALLVQLATARPRPVKA^PLDIiRVQt 
KDWPHAGRPAHELRYSIYRDLWERG?FLSAAGKFGGDFLVYPGD 
PLRFHAHY I AQCWAPEDT I PLQDLVAAGRLGTSVRKTLLLCS PQ 
PDGKWYTSLQWASLQ 


6352 






AMARSLVHDTVFYCLSVYQVKISPTPQLGAASSAEGHVGQGAPG 
LMGNMKPEGGWHENGMNRDGGMIPEGGGGNQEPRQQPQPPPEE 
PAQAAMEGPQPENMQPRTRRTKFTLLQVEELESVFRHTQYPDVP 
TRRELAENLGVTEDKVRVWFKNKRARCRRHQRELMLANELRADP 
DDCVYIWD 


6353 


65 


572 


RFAGAGAIPEARARPPDVQAAEEEKEMDLPDSASRVFCGRILSM 
VNTDDVNAIILAQKNMLDRFEKTNEMLLNFNNLSSARLQQMSER 
FLHHTRTLVEMKRDLDGIFRRIRTLKGKLARQHPEAFSHIPEAS 
FLEEEDEDPIPPSTTTTIATSEQSTGSCDTSPDTVSPSLSPGFE 
DLSHVQPGSPAINGRSQTDDEEMTGE 


S354 


965 


510 


PSLRPMEPTRDCPLFGGAFSAILPMGAIDVSDLRPVPDNQEVFC 
HPVTDQSLIVELLELQAHVRGEAAARYHFEDVGGVQGARAVHVE 
SVQPLSLENLALRGRCQEAWVLSGKQQIAKENQQVAKDVTLHQA 
LLRLPQYQTDLLLTFNQPP 


6355 




1662 


RGSSAAFRGSGLRGAMIRRVLPHGMGRGLLTRRPGTRRGGFSLD 
WDGKVSE I KKKI KS I LPGRS CDLLQDTSHLPPEHSDWIVGGGV 
LGLSVAYWLKKLESRRGAIRVLWERDHTYSQASTGLSVGGICQ 
QFSLPENIQLSLFSASFLRNINEYLAWDAPPLDLRFNPSGYLL 
LASEKDAAAMESNVKVQRQEGAKVSLMSPDQLRNKFPWINTEGV 
ALASYGMEDEGWFDPWCLLQGLRRKVQSLGVLFCQGEVTRFVSS 
SQRMLTTDDKAWLKRIHEVHVKMDRSLEYQPVECAIVINAAGA 
WSAQ IAALAGVGEGP PGTLQGTKLPVE PRXRYVYVWHCPQGPGL 
ETPLVADTSGAYFRREGLGSWYLGGRSPTEQEEPDPANLEVDHD 
FFQDKVWPHLALRVPAFETLKVQSAKAGYYDYNTFDQNGWGPH 
PLWNMYFATGFSGHGLQQAPGIGRAVAEMVLKGRFQTIDLSPF 
LFTRFYLGEKI QENNI I 


S356 


354 


633 


TGLTSSCLPIiQVMMTKRTKDMGKFSSVTVSTIDEEEEEIEAREV 
ADSYAQNAKVIEKQLERKGMSKRRLQELAELEAKKAKMXGTLID 
NQFK 
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SEQ 
NO: 


Predicted 
beginning 
nucleotide - 

corresponding 
to first 
amino acid 
residue of 


Predicted end 
nucleotide 
location 
corresponding 
to first 

residue of 
amino acid 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Hiatidine, I^Isoleucine, K=Lysine, 
L=Leucine ( M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 


6357 


2 


915 


GLLRNMALLVRVLRNQTS ISQWVP VCSRL I PVS PTQGQGDRALS 
RTSQWPQMSQSQACGGSEQIPGIDIQLNRKYHTTRKLSTTKDSP 
QPVEEKVGAFTKIIEAMGFTGPLKYSKWKIKIAALRMYTSCVEK 
TDFEEFFLRCQMPDTFNSWFLITLLHVWMCLVRMKQEGRSGKYM 
CR 1 I VHFMWEDVQQRGRVMGWPYI LKKNMI LMTNHF YAAILGY 
DEGILSDDHGI.AAALWRTFFNRKCEDPRHLELLVEYVRKQIQYL 
DSMNGEDLLLTGEVSWRPLVEKNPQSILKPIISPTYNDEGL 


6358 


2009 


1040 


ASDALHSLSAPVLRLSSRSAARPATMTEQAISFAKDFLAGGIAA 
AI S KTAVAP 1 ERVKLLLQVQHAS KQ I AADKQYKG I VDCI VRI PK 
EQGVLSFWRGNLANVIRYFPTQALNFAFKDKYKQIFLGGVDKHT 
QFWRYFAGNLASGGAAGATSLCFVYPLDFARTRLAADVGKSGTE 
REFRGLGDCLVKITKSDGIRGLYQGFSVSVQGIIIYRAAYF3VY 
DTAKGMLPDPKNTHIWSWMIAQTVTAVAGWSYPFDTVRRRMM 
MQSGRKGADIMYTGTVDCWRKIFRDEGGKAFFKGAWSNVLR3MG 
GAFVLVLYDELKKVI 


G359 


98 


1086 


VCRQEEEKMKEDCLPSSHVPISDSKSIQKSELLGLLKTYNCYHE 
GKSF0LRHRE33GTLIIEGLLNIAWGLRRPIRl,QMQDDKKQV--iT. 
PSTSWMPRRPSCPLKEPSPQNGNITAQGPSIQPVHKAESSTDSS 
GPLEEAEEAPQLMRTKSDASCMSQRRPKCRAPGEAQRIRRHRFS 
IKGHFYNHKTSVFTPAYGSVim'RWSTMTTLQVLTLLLKKFRV 
EEGPSEFALYIVHRSGERTKr.KDCEYPLISRILHGPCEKIARIF 
LMEADLGVEVPHEVAQYIKFEMPVLDSFVEKLKEEEEREIIKLT 
MKFQALRLTMLQRLEQLVEAK 


S360 


1 


345 


GTRGAV? S TLEE WLPPRS CRVFW IIISHTTMS KVS FKI TLTSDP 
RLPYKVLSVPESTPFTAVLKFAAEEFKVPAATSAI ITNDG IGIN 
PAQTAGNVFLKHGSELRT I PRDRVGSC 


6361 


615 


158 


RPGLGQLQHCALAPQAGNRRCRFHGRL,HALTRSTHRGKPMS I MQ 
FKDTLNTPLPDSSPVAVPLGAPIAVASTLSVEHNDGVETGIWAC 
APGRWRRQITSQEFCIIFIQGRCTFTFDD3ETLHIQAGDALMLPA 
NSTG I WD I QET VRKTYVL I L 


63 6 2 


350 


1575 


TTMDGSHSAALKLQQLPPTSSSSAVSEASFSYKENLIGALLAIF 
GHLWSIALNLQKYCHIRLAGSKDPRAYFKTKTWWLGLFLMLLG 
ELGVFASYAFAPLSLIVPLSAVSVIASAIIGIIFIKEKWKPKDF 
LRRYVLSFVGCGLAWGTYLLVTFAPNSHEKMTGENVTRHLVSW 
PFLLYMLVEIILFCLLLYFYKEKWANNIWILLLVALLGSMTW 
TVKAVAGMLVLS I QGNLQLDYP IFYVMFVCMVATAVYQAAFLSQ 
ASQMYDSSLIASVGYILSTTIAITAGAIFYLDFIGEDVLHICMF 
ALGCLIAFLGVFLITRNRKKPIPFEPYISMDAMPGMQNKHDKGM 
TVQPELKASFSYGALENNDNISEIYAPATLPVMQEEHGSRSASG 
VPYP.VLSHTKKE 


6363 


21 


1201 


RRTRLGSSFPRRRDSSAMESYDVIANQPWIDNGSGVIKAGFAG 
DQIEKYCFPNYVGRPKHVRVMAGALEGDIFIGPKAE3HRGLLSI 
RYPMEHGIVKDWNDMERIWQYVYSKDQLQTFSEEHPVLLTEAPL 
NPRKKRERAAEVFFETFNVPALFISMQAVLSLYATGRTTGWLD 
SGDGVTHAVPIYEGFAMPHSIMRIDIAGRDVSRFLRLYLRKEGY 
DFHSSSEFEIVKAIKERACYliSINPQKDETLETEKAQYYLPDGS 
TIEIGPSRFRAPELLFRPDLIGEESEGIHEVLVFAIQKSDMDLR 
RTLFSNIVLSGGSTLFKGFGDRLLSEVKKLAPKDVKIRISAPQE 
RLYSTWIGGS1LASLDTFKKMWVSKKEYEEDGARSIHRKTF 


6364 


21 


1201 


RRTB -GSSFPRRRDSSAMESYDVIANQPWIDNGSGVIKAGFAG 
DQIPKYCFPNYVGRPKHVRVMAGALEGDIFIGPKAEEHRGLLSI 
RYPMEHGIVKDWNDMERIWQYVYSKDQLQTFSEEHPVLLTEAPL 
NPRKNRERAAEVFFETFNVPALFISMQAVLSLYATGRTTGWLD 
SGDGVTHAVPIYEGFAMPHSIMRIDIAGRDVSRFLRLYLRKEGY 
DFHSSSEFEIVKAIKERACYLSINPQKDETLETEKAQYYLPDGS 
TIEIGPSRFRAPELLFRPDLIGEESEGIHEVLVFAIQKSDMDLR 
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corresponding 



Ammo acid segment containing signal peptide 
{A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 
RTLFSNIVLSGGSTLFKGFGDRLLSEVKKIAPKDVKIRISAPQE 
RLYSTWIGGSILASLDTFKKMWVSKKEYEEDGARSI HRKTF 
KHKSRASU\ARAQAFGPSREREVHSRFRSGLRRLGESNSGCCTM ~ 
ASMGTLAFDEYGRPFLIIKD3DRKSRLMGLEALKSHIMAAKAVA 
NTMRTSLGPNGLDKMMVDKDGDVTVTNDGATILSMMDVDHQIAK 
LMVELSKSQDDEIGDGTTGVWLAGALbEEAEQLLDRGIHPIRI 
ADGYEQAARVAIEHLDKISDSVLVDIKDTEPLIQTAKTTLGSKV 
WSCHRQMAEIAVNAVLTVADMERRDVDFELIKVEGKVGGRLED 
TKLIKGVIVDKDFSHPQMPKKVEDAKIAILTCPFEPPKPKTXHK 
LDVTSVEDYKALQKYEKEKFEEMIQQIKETGAHLAICQWGFDDE 
ANHLLLQMNLPAVRWVGGPEIELIAIATGGRIVPRFSELTAEKL 
GFAGLVQEISFGTTKDKMLVIEQCKNSRAVTIFIRGGNKMIIEE 
AKRSLHDALCVIRNLIRDNRWYGGGAAEISCALAVSQEADKCP 
TLEQYAMRAFADALEVIPMALSENSGMNPIQTMTEVRARQVKEM 
NPALGIDCLKKGTNDMKQQHVIETLIGKKQQISLATQMVRMILK 
IDDIRKPGESEE 
GNKEGAH3ST?WVT,I,a:i >'l .GAVAMT.nKEQGffVLGLNAVFDILV 
IGKFHVLEIVQKVLHKDKSLENLGMLRNGGLLFRMTLLTSaGAG 
MLYVRWRIMGTGPPAFTEVDNPASFADSMLVRAVWYNYYYSLNA 
WLLLCPWWLCFDWSMGCIPLIKSISDWRVIALAALWFCLIGLIC 
QALCSEDGHKRRILiTLGLGFLVIPFLPASNLFFRVGFWAERVL 
YLPS VGYCVLLT FGFGALS 10 ITKKKKL I AAWLG I LF I NTLRCV 

lrsgewrseeqlfrsalsvcplnakvhynigknladkgnqtaai 
ryyreavrlnpkyvhamnnlgnilkern3lqeaeellslavqiq 
pdfaaawmnlgivqnslkrfeaaeqsyrtaikhrrkypdcyynl 
grlyadlnrhvdat.nawrnatvlkpehslawnnmiIjLDktgnl 

AQAEAVGREALELIPNDHSLMFSLANVLGKSQKYKESEALFLKA 
IKANPNAASYHGNLAVLYHRWGHLDLAKKHYEISLQLDPTASGT 
KENYGLLRRKLELMQKKAV 



SIGFPVMLVLSILLYTCEMFQDSVAFEDVAVSFTQEEWALLDPS 
QKNLYRDVMQETFKNLTSVGKTWKVQNIEDEYKNPRRNLSLMRE 
KLCESKESHHCGESFNQIADDMLNRKTLPGITPCESSVCGEVGT 
GHSSLNTHIRADTGHKSSEYQEYGENPYRNKECKKAFSYLDSFQ 
SHDKACTKEKPYDGK3CTETFISHSCIQRHRVMHSGDGPYKCKF 
CGKAFYFLNLCLIHERIHTGVKPYKCKQCGKAFTRSTTLPVHER 
THTGVNADECKECGNAFSFPSEIRRHKRSHTGEKPYECKQCGKV 
FI S FS S IQYHKMTHTGEKPYECKQCGKAFRCGSHLQKHGRTHTG 
EKPYECRQCGKAFRCTSDLQRHEKTHTEDKPYGCKQCGKGFRCA 
SQLQIHERTHSGEKPHECKECGKVFKYFSSLRIHERTHTGEKPH 
ECKQCGKAFRYFSSLHIHERTHTGDKPYECKVCGKAFTCSSSIR 
YHERTHTGEKPYECKHCGKAFISNYIRYHERTHTGEKPYQCKQC 
GKAFIRASSCREHERTHTINR 



RPVPAKLNPRSWPRTAGALPLRPPPLTMAVFHDEVEIEDFQYDE 
DSETYFYPCPCGDNFSITKEDLENGEDVATCPSCSLIIKVIYDK 
DQFVCGETVPAPSANKELVKC 



AGCCRDTRFPTPRGPGSLCHNFCRSAACTVTRTIHGSPREDTGT 
PRSREMMFQDSVAFEDVAVSFTQEEWALLDPSQKNLYRDVMQET 
FKNLTSVGKTWKVQNIEDEYKNPRRNLSLMREKLCESKESHHCG 
ESFNQIADDMLNRKTLPGITPCESSVCGEVGTGHSSLNTHIRAD 
TGHKSSEYQEYGENPYRNKECKKAFSYLDSFQSHDKACTKEKPY 
DGKECTETFISHSCIQRHRVMHSGDGPYKCKFCGKAFYFLNLCL 
IHERIHTGVKPYKCKQCGKAFTRSTTLPVHERTHTGVNADECKE 
CGWAFSFPSEIRRHKRSHTGEKPYECKQCGKVFISFSSIQYHKM 
THTGEKPYECKQCGKAFRCGSHLQKH3RTHTGEKPYECRQCGKA 
FRCTSDLQRHEKTHTEDKPYGCKQCGKGFRCASQLQ1HERTHSG 
EKPHECKECGKVFKYFSSLRIHERTHTGEKPHECKQCGKAFRYF | 
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SEQ 
ID 
NO: 


beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 

residue of 
sequence 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine ; 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine , V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








S S1HIHERTHTGDKP YE CKVCGKAFTCS S S IRYHERTHTGEKP Y 
ECKHCGKAFISNYIP.YHERTHTGEKPYQCKQCGKAFIRASSCRE 
HERTHTINR 


6370 


1711 


329 


FVLS EQRLRTERTWPRS PGLGRGAAAAGARTAGAGLLRLLLGCG 
ALVGGLRPVTMTTPANAQNASKTWELSLYELHRTPQEAIMDGTE 
IAVSPRSLHSELMCPICLDMLKNTMTTKECLHRFCSDCIVTALR 
SGNKSCPTCRKKLVSKRSLRPDPNFDALISKIYPSKEEYEAHQD 
RVLIRLSRLHNQQALSSSIEEGLRMQAMHRAQRVRRPIPGSDQT 
TTMSGGEGEPGEGEGDGEDVSSDSAPDSAPGPAPKRPRGGGAGG 
SSVGTGGGGTGGVGGGAGSEDSGDRGGTLGGGTLGFPSPPGAPS 
PPEPGGEIELVFRPHPLLVEKGEYCQTRYVKTTGNATVDHLSKY 
LALRIALERRQQQEAGEPGGPGGGASDTGGPDGCGGEGGGAGGG 
DGPEEPALPSLEGVSEKQYTIYIAPGGGAFTTLNGSLTLELVNE 
KFWKVSRPLELCYAPTKDPK 


G371 


3 


288 


GVANMSTAMTJFGTKSFQPRPPDKGSFPLDHLGECRSFKEKFMKC 
LHNNNFENALCRKESKEYLECRMERKLMLQEPLEKLGFGDLTSG 
KSEAKK 


5372 


2141 


625 


EVSAIASEGKAEERYKKLEDLLEKSFSLVKMPSLQPVVMCVMKH 
L P KVP EKKLKLVMAD KELYRACAVE VRRQ I W QDNQALFGDE VS P 
LLKQYILEKESALFSTELSVLHKFFSPSPKTRRQGEWQRLTRM 
VGKNVKLYDMVLQFLRTLFLRTRNVHYCTLRAELLMSLHDLDVG 
EICTVDPCHKFTWCLDACIRERFVDSKRARELQGFUDGVKKGQE 
QVLGDLSMILCDPFAIMTIJVLSTVRHLQELVGQETLPRDSPDLL 
LLLRLLALGQGAWDMIDSQVFKEPKMEVELITRFLPMLMSFLVD 
DYTFNVDQKLPAEEKAPVSYPNTLPESFTKFLQEQRMACEVGLY 
YVLH I TKQRNKNALLRLLPGL VE T FGDLAFGD I FLHLLTGNLAL 
LADEFALEDFCSSLFDGFFLTASPRKENVHRHALRLLIHLHPRV 
APSKLEALQKALEPTGQSGEAVKELYSQLGEKLEQLDHRKPSPA 
QAAETPALELPLPSVPAPAPL 


6373 


67 


711 


PSRAARASPARLPAMVSWIISRLWLIFGTLYPAYYSYKAVKSK 
DIKEYVKWMMYWIIFALFTTAETFTDIFLCWFPFYYELKIAFVA 
WLLSPYTKGSSLLYRKFVHPTLSSKEKEIDDCLVQAKDRSYDAL 
VHFGKRGLNVAATAAVMAASXGCGALSERLRSFSMQDLTTIRGD 
GAPAPSGPPPPGSGRASGKHGQPKMSRSASESASSSGTA 


6374 


535 


2105 


HKLFCSYISTSEFPSSTRHHSCPTHTFCNYTSSTIFLSSTRDHS 
CPTHTFCNYTSSTIFLSSTRDHSCPTHTSCNYTSSTIFLSSTRD 
HSCPTHTSCNYTSSTIFLSSTRDHSCPTHTFCNYPRPIIRLSSC 
CPAELQTEGSNGKKEVLSGFQWLEDTVLFPEGGGQPDDRGTIN 
DISVLRVTRRGEQADHFTQTPLDPGSQVLVRVDWERRFDHMQQH 
SGQHLITAVADHLFKLKTTSWEL3RFRSAIELDTPSMTAEQVAA 
IEQSTOEKIRDRLPVNVRELSLDDPEVEQVSGRGLPDDHAGPIR 
VWIEGVDSNMCCGTHVSNLSDLQVIKILGTEKGKKNRTNLIFL 
SGNRVLKWMERSHGTEKALTALLKCGAEDHVEAVKKLQNSTKIL 
QKNNLWLLRDLAVHIAHSLRWSPDWGGWILHRKEGDSEFMNII 
ANEIGSEETLLFLTVGDEKGGGLFLLAGPPASVETLGPRVAEVL 
E 31 -.-.GKKGRFQGKATKMSRRMEAQALLQDYISTQSAKE 


6375 




1535 


AIMAAATRPVRLPEAGCEGRERCWNESRSRSHSGEGGLAAWSRT 
CPGRPRR PGQQ WRG PTMLVTAYLAFVGLLAS CLGLELS RCRAK 
PPGRACSNPSFLRFQLDFYQVYFLALAADWIiQAPYLYKIiYQHYY 
FIY5GQIAILYVCGLASTVLFGLVASSLVDWLGRKNSCVLFSLTY 
SLCCLTKLSQDYFVLLVGRALGGLSTALLFSAFEAWYIHEHVER 
HDF7AEWIPATFARAAFWNHVLAWAGVAAEAVASWIGLGPVAP 
FVAAIPLLALAGALALRNWGENYDRQRAFSRTCAGGLRCLLSDR 
RVLLLGTIQALFESVIFIFVFIiWTPVLDPHGAPLGIIFSSFKAA 
SLLGSSLYRIATSKRYHLQPMHLLSLAVLIWFSLFMLTFSTSP 
GQESFVESFIAFLLIEIiACGLYFPSMSFLRRKVIPETEQAGVLM 
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SEQ 
WO: 


Predicted 
beginning 

corresponding 
to first 
amino acid 
residue of 

sequence 


corresponding 
segSence ld 


Amino acid segment containing signal peptide 
<A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Iiysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








W?RVPLHSLACLGLLVLHDSDRKTGTRNMFSICSAVMVMALLAV 
VGLFTWRHDAELRVPSPTEEPYAPEL 


6376 


380 


143 7 


ISSTDIDHYRFSFLWSKMPSKESWSGRKTNRAAVHK8KQEGRQ 
QDLLIAALGMKLGSPKSSVTIWQPLKLFAYSQLTSLV^RATLKE 
NEQIPKYEKIHNFKVHTFRGPHWCEYCANFMWGLIAQGVKCADC 
GLNVHKQCSKMVPNDCKEDLKHVKKVYSCDLTTLVKAHTTKRPM 
WDMCIREIESRGLNSEGLYRVSGFSDLIEDVKMAFDRDGEKAD 
ISVNMYEDINIITGALKLYFRDLPIPLITYDAYPKFIESAKIMD 
PDEQLETLHEALKLLPPAHCETLRYLMAHLKRVTLHEKENLMNA 
ENLGIVFGPTLMRSPELDAMAALNDIRYQRLWELLIKNEDILF 


6377 


2311 


M s 


SRIRRRSSRRPREPPGPSRRRRRRRPDPRTMPSEKTFKQRRTFE 
QRVEDVRLIREQHPTKIPVIIERYKGEKQLPVLEKTKFLVPDHV 
NMSELIKIIRRRLQLNANQAFFLLVNGHSMVSVSTPISEVYESE 
KDEDGFL YMVYAS QETFGMKLS V 


6378 


686 




f- r i " r h 
DLALIPDVDIDSDGVFKYVLIRVHSAPRSGAPAAESKEIVRGYK 
WAEYHADIYDKVSGDMQKQGCDCECLGGGRISHQSQDKKIHVYG 
YSMAYGPAQHAISTEKIKAKYPDYEVTWANDGY 


S379 


35 


378 


E . -tPSRAALRRCAPQRSQAPRWPDRAACRRSFQGSQGRAY 
LFNSWK\ r GCGPAEERVLLTGLHAVAniYCENCKTTLGWKYEHA 
FES S QKYKEGKYI I ELAHMIKDNGWD 


S380 


1414 




PAVQGCBGAGPPTGRGSGNMARFALTVVRHGETRFKKEKIIQGQ 
GVDEPLSETGFKQAAAAGIFLNNVKFTHAFSSDLMRTKQTMHGI 
LERSKFCKDMTVKYDSRLRERKYGWEGKALSELRAMAKAAREE 
CPVFTPPGGETLDQVKMRGIDFFEFLCQLILKEADQKEQFSQGS 

MRSLFDYFLTDLKCSLPATLSRSELMSVTPNTGMSLFIINFEEG 
REVKPTVQCICMNLQDHLNGLTEKSLGLNLPSKSNHFEPLKGVP 
LALFTSLLC 


6381 


1663 


218 


AWRAQGSRGFSGAGWRPRQAAAMNFSEVFKLSSLLCKFSPDGK 
YLASCVQYRLWRDVNTLQILQLYTCLDQIQHIEWSADSLFILC 
AMYKRGLVQVWSLEQPEWHCKIDEGSAGLVASCWSPDGRHXLNT 
TEFHLRITVWSLCTKSVSYIKYPKACLQGITFTRDGRYMALAER 
RDCKDYVGIFVCSDWQLLRHFDTDTQDLTGIEWAPNGCVLAVWD 
TCLEYKILLYSLDGRLLSTYSAYEWSLGIKSVAWSPSSQFIiAVG 
SYDGKVRILNHVTWKMITEFGHPAAINJPKIWYKEAEKSPQL3 
LGCLSFPPPRAGAGFLPSSESKYEIASVPVSLQTLKPVTDRANP 
K1G1GMLAFSPDSYFLATRNDN1PNAVWVWDIQKLRLFAVLEQL 
SPVRAFQWDPQQPRLAICTGGSRLYLWSPAGCMSVQVPGEGDFA 
VLSLCWHLSGDSMALLSKDHFCLCFLETEAWGTACSQLGGHT 


6382 


2 


1062 


FE-ii'iJDRNLCLIAYPLKGDHGIVDIVDNSDCEPKSKLLRWTTNK " 

KHHVLETEKTPKDWVROHRKEEKMKSHKLEEEFEWLKKSEVLYY 

TVEKKGNISSQLKHYNPWSMKCHQQQLQRMKENAKHRMQYKFIL 

LENLTSRYEVPCVLDLKMGTRQHGDDASEEKAANQIRKCQQSTS 

AVIGVRVCGMQVYQAGSGQLMFMNKYHGRXLSVQGFKEALFQFF 

HXGRYLRRELLGPVLKKLTELKAVLEEQESYRFYSSSLLVIYDG 

KERPEWLDSDAEDLEDLSEESADESAGAYAYKP1GASSVDVRM 

IDFAHTTCRLYGEDTWHEGQDAGYIFGLQSLID1VTEISEESG 

E 


6383 


3159 


1061 


S PAPGRPS PHGSQPAARAAAAPAMPSAKQRGSKGGHGAASPSEK 
GAHPSAARPLAAPTPAAPACRSPSPGGAPASFPGRAPRSLASQP 
AARAAAAPAMPSAKQRGSKGGHGAAS PS EKGAHPS GGADDVAKK 
PPPAPQQPPPPPAPHPQQHPQQHPQNQAHGKGGHRGGGGGGGKS 
SSSSSASAAAAAAAASSSASCSRRLGRALNFLFYLALVAAAAFS 
GWCVHE\ r LEEVQQVRRSHQDFSRQREELGQGLQGVEQKVQSLQA 
TFGTFESILRSSQHKQDLTEKAVKQGESEVSRISEVLQKLQNEI 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corre sponding 

amino acid 

sequence 


Predicted end 
nucleotide 
location 
corresponding 

amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 

P=Proline, Q=Glutamine, H=Arginine, 
S=Serine, T=Threonine , V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








LKDLSDGIHWKDARERDFTSLENTVEERLTELTKSINDNIAIP 
TEVQKRSQKEINDMKAKVASLEESEGNKQDLKALKEAVKEIQTS 
AKSHEWDMEALRSTLQTMESDIYTEVRELVSLKQEQQAFKEAAD 

KEDGGFRHSEAFEALQQKSQGLDSRLQHVEDGVLSMQVASARQT 
ESLESLLSKSQEHEQRLAALQGRLEGLGSSEADQDGLASTVRSL 
GETQLVLYGDVEELKRSVGELPSTVESLQKVQEQVHTLLSQDQA 
QAARLPPQDFLDRLSSLDNLKASVSQVEADLKMLRTAVDSLVAY 
SVKIETNENNLESAKGLLDDLRNDLDRLFVKVEKIHEKV 


5384 


738 




IWEVPVCLTHLLHLQQANQPLPPPSSSINEEDADEANRAIGEKR 
AAPDSGKKPKTPKTKQQKDPNEPQKPVSAYALFFRDTQAAIKGQ 
NPNATFGEVSQIVASMWDSLGEEQKQVYKRKTEAAKKEYLKALA 
AYRASLVSKAAAESAEAQTIRSVQQTLASTNLTSSLLLNTPLSQ 
HGTVSASPQTLQQSLPRSIAPKPLTMRLPMNQIVTSVTIAANMP 
SNIGAPL1SSMGTTMVGSAPSTQVSPSVQTQQHQMQLQQQQQQQ 
QQQMQCMQUCQLCQHUMII^CIQQQMQQC'H^QHHMQQHIjQCQQQH 
LQi i • ! NQQ QLQQQLQQRLQLQQLQHMQHQSQPS PRQHS PVASQ I 
TSPIPAIGSPQPASQQHQSQIQSQTQTQVLSQVSIF 


6385 


2 


1584 


PRVRAADVAAGAQAVVSAGMAKSUGENGPRAPAAGESLSGTRES 
LAQGPDAATTDELSSLGSDSEANGFAERRIDK7GFIVGSQGAEG 

ajjEEVpleylrqreskwldmt xnwdawmakkhkkirlrcqkc^ip 
pslrgrawqyi.sggkvklqqnpgkkdeldmspgdpkwldvierd 
lhrqfpfhemfvsrgghgqqdlfrvjkaytlyrpeegycqaqap 

IAAVLLMHMPAEQAFWCLVQICEKYLPGYYSEKLEAIQLDGEIL 
FSLLQKVS PVAHKHLSRQXIDPLLYMTEWFMCAFSRTLPWSSVL 

LRSLSPKIMQEAFLVQEWELPVTERQIEREHLIQLRRWQETRG 
ELQCRSPPRLHGAKAILDAEPGPRPALQPSPSIRLPLDAPLPGS 
KAKPKPPKQAQKEQRKQMKGRGQLEKPPAPNQAMVVAAAGDACP 
PQHVPPKDSAPKDSAPQDLAPOVSAHHRSQESLTSQESEDTYL 


6386 




195 


RAQILGGANTPYEKGVFKLEVIIPERYPFEPPQIRFLTPIYHPN 
IDSAGRICLDVLKLPPKGAWRPSLNIATVLTSIQLLMSEPNPDD 
PLMADISSEFKYNKPAFLKNARQWTEKHARQKQKADEEEMLDNL 
PEAGDSRVHNSTQKRKASQLVGIEKKFHPDV 


6387 






KQELAETLANLERQIYAFEGSYLEDTQMYGNI1RGWDRYLTNQK 
WSNSKWDRRWRKFKEAERLFSKSSVTSAAWSALAGVQDQLIEK 
REPGSGTESDTSPDFHNQENEPSQEDPEDLDGSVQGVKPQKAAS 
STSSGSHHSSHKKRKNKNRHSPSGMFDYDFEIDLKLNKKPRADY 


6388 


1 


662 


PGPTHASADAWADAWAQPNMAMHNKAAPPQIPDTRR3LAELVKR 
KQELAETLANLERQIYAFEGSYLEDTQMYGNIIRGWDRYLTtTQK 
NSNSKNDRRNRKFKEAERLFSKSSVTSAAAVSALAGVQDQLIEK 
REPGSGTESDTSPDFHNQENEPSQEDPEDLDGSVQGVKPQKAAS 
STSSGSHHSSHKKRKNKNRHSPSGMFDYD?EIDLKLNKKPRADY 


6389 


1074 


497 


AEPGDRMAGHRLVLVLGDLHIPHRCNSLPAKFKKLLVPGKIQH1 
LCTGNLCTKESYDYLKTLAGDVHIVRGDFDENLNYPEQKWTVG 
QFKIGLIHGHQVIPWGDMASLALLQRQFDVDILISGHTHKFEAF 
EHENKFYINPGSATGAYWALETNIIPSFVLMDIQASTWTYVYQ 
LIGDDVKVERIEYKKP 


6390 


158 


535 


GEERKEGRAPGKAFAPERKPAKMEKEETTRELLLPNWQGSGSHG 
LTIAQRDDGVFVQEVTQNSPAARTGWKEGDQIVGATIYFDNLQ 
S G E VTQ LLNTMGHHT VGLKLHRKGDRFF P S LGQTWDP 


6391 


5386 


2897 


\ WNS KTE C YLS IQTQENFPANLNELVNCI VI SSLVTTQRKLKA 
MSLLGSRNQIARAVLNPNPMDFCTKDLLTTTSERIIAYLRDFNE 
DQKKAIETAYAMVKHSPSVAKICLIHGPPGTGKSKTIVGLLYRL 
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SEQ 


Predicted 
beginning 
nucleotide 

corresponding 
to first 
amino acid 
residue of 
amino acid 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 

sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X-Unknown , *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








LTENQRKGHSDENSNAKIKQNRVLVCAPSNAAVDELMXKIILEF 

MKKELPSHVQAMHKRKEFLDYQLDELSRQRALCRGGREIQRQEL 
DENISKVSKERQELASKIKEVQGRPQKTQSIIILESHIICCTLS 
TSGGLLLESAFRGQGGVPFSCVIVDEAGQSCEIETLTPLIHRCN 
KLILVGDPKQLPPTVISMKAQEYGYDQSMMAHFCRLLEENVEHN 
MISRLPILQLTVQYRMHPDICLFPSNYVYNRNLKTNRQTEAIRC 
SSDWPFQPYLVFDVGDGSERRDNDSYINVQEIKLVMEIIKLIKD 
KRKDVS FRNIGIITHYKAQKTMIQKDLDKEFDRKGPAEVDTVDA 
FQGRQKDCVIVTCVRANSIQGSIGFLASLQRLNVTITRAKYSLF 
ILGHLRTLMENQHWNQL IQDAQKRGAI I KTCDKNYRHDAVKI LK 
LKPVLQRSLTHPPTIAPEGSRPQGGLPSSKLDSGFAKTSVAASL 
YHTPSDSKE I TLTVTS KDPERP P VHDQLQDPRLLKRMG I EVKGG 
IFLWDPQPSSPQHPGATPPTGEPGFPWHQDLSHVQQPAAWAA 
LSSHKPPVRGEPPAASPEASTCQSKGDDPEEELCHRREARAFSE 
GEQEKCGSETHHTRRNSRWDKRTLEQEDSSSKKRKLL 


S392 


972 


186 


,11 , r LLi.<< S SMAHRLQ IRLLTWDVKDTTiLRLRHPLGEAYATKAR 

VLQTFHLAGVQDAQAVAPIAEQLYKDFSHPCTWQVLDGAEDTLR 
ECRTRGLRLAVISNFDRRLEGILGGLGLREHFDFVLTSEAAGWP 
KPDPRIFQEALRLAHMEPWAAHVGDNYLCDYQGPRAVGMHSFL 
WGPQALDPWRDSVPKEHILPSLAHLLPALDCI,EGSTPGL 


S3 93 


2017 


730 


TGGSKMAAVATCGSVAASTGSAVATASKSNVTSFQRRGPRASVT 
NDSGPRLVSIAGTRPSVRNGQLLVSTGLFALDQLLGGGLAV3TV 

ELPAPLLDDKCKKEFDEDVYNHKTPESNIKMKIAWRYQLLPKME 
IGPVSSSRFGHYYDASICRMPQELIEASNWHGFFLPEKISSTLKV 
EPCSLTPGYTKLLQFIQNIIYEEGFDGSNPQKKQRNILRIGIQN 
LGSPLWGDDICCAENGGNSHSLTKFLYVLRGLLRTSLSACIITM 
PTHLIQNKAIIARVTTLSDWVGLESFIGSERETNPLYKDYHGL 
IHIRQIPRLNNLICDESDVKDLAFKLKRKLFTIERLHLPPDLSD 
TVSRSSKMDLAESAKRLGPGCGMMAGGKKHLDF 


6394 


1418 


511 


GAAAGGEGARRRPAAMATVMAATAAERAVLEEEFRWLLHDEVHA 
VT.KQTiQDILKEASLRFTLPGSGTEGPAKQENFILGSCGTDQVKG 
VLTLQGDALSQADVNLKMPRNNQLLHFAFREDKQWKLQQIQDAR 
NHVSQAIYLLTSRDQSYQFKTGAEVLKLMDAVMLQLTRARNRLT 
TPATLTLPEIAASGLTRMFAPALPSDLLVNVYINLNKLCLTVYQ 
LHALQPNSTKNFRPAGGAVLHSPGAMFEWGSQRLEVSHVHKVEC 
VIPWLNDALVYFTVSLQLCQQLKDKISVFSSYWSYRPF 


S395 


13 




ESVTFEDVAVEFIQEWALLDSARRSLCKYRMLDQCRTLASRGTP 
PCKPSCVSQLGQRAEPKATERGILRATGVAWESQLKPEELPSMQ 
DLLEEASSRDMQMGPGLFLRMQLVPSIEERETPLTREDRPALQ3 
PPWSLGCTGLKAAMQ1QRWIPVPTLGHRNPWVARDSGE 


S39S 




1221 


ANILS SPSKRGQKGTLIGYSPEGTPLYWFMGDAFQHSSQSIPRF 
IKESLKQILEESDSRQIFYFLCLNLLFTFVELFYGVLTNSLGLI 
SDGFHMLFDCSALVMGLFAALMSRWKATRIFSYGYGRIEILSGF 
INGLFLIVIAFFVFMESVARLIDPPELDTHMLTPVSVGGLIVNL 
IGICAFSHAHSHAHGASQGSCHSSDHSHSHHMHGHSDHGHGHSH 
GSAGGGMNANMRGVFLHVLADTLGSIGVIVSTVLIEQFGWFIAD 
PLCSLFIAILIFLSWPLIKDACQVLLLRLPPEYEKELHIALEK 
IQKIEGLISYRDPHFWRHSASIVAGTIHIQVTSDVLEQRIVQQV 
TGILKDAGWNLTIQVEKEAYFQHMSGLSTGFHDViAMTKQMES 
MKYCKDGTYIM 


6397 391 


122 


GAGGVGRFEAIRAPARMIEWCNDRLGKKVRVKCNTDDTIGDLK 
KLIAAQTGTRWNKIVLKKWYTIFKDHVSLGDYEIHDGMNLELYY 



498 



